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SEQUENCE LISTING 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(1) GENERAL INFORMATION 



APPLICANT: 

(A) NAME: Pasteur Merieux serums et vaccins 

(B) STREET: 58/ avenue leclerc 

(C) CITY: Lyons 

(E) COUNTRY: France 

(F) POSTAL CODE: 69007 

(A) NAME : Transgene 

(B) STREET: 11, rue de Mo Is he im 

(C) CITY: Strasbourg 
£) COUNTRY: France 
?) POSTAL CODE: 67 000 




(ii) 
(iii) 
(iv) 



TITLE A 
NUMBER 



INVENTION: Tbp2 fragments of N. meningitidis 
SEQUENCES: 35 



COMPUTER READABLE FORM: 

(A) MEDIUM VYPE : Tape 

(B) COMPUTER: IBM PC compatible 

(C) OPERATINGXSYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(2) INFORMATION FOR SEQ ID NO 



(ii) 
(vi) 

<ix) 

(ix) 

(ix) 

(ix) 

(ix) 

(ix) 

(ix) 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2230 base X paii 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : singl« 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic)" 

ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: IM2169 

FEATURE: 

(A) NAME/KEY: sigpeptide 

(B) LOCATION: 60.. 119 

FEATURE: 

(A) NAME/KEY: matpeptide 

(B) LOCATION: 120.. 2 192 

FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 60.. 2192 

FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 120.. 1154 

FEATURE : 

(A) NAME/KEY: miscfeature 

(B) LOCATION: 1155.. 1748 

FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1749.. 2192 

FEATURE : 

(A) NAME/KEY: miscbinding 

(B) LOCATION: 127.. 1169 
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SEQUENCE DESCRIPTION : SEQ ID NO: 1: 



atttgttaA^ aataaataaa ataataatcc ttatcattct ttaattgaat tgggtttat 

ATG AAC AA* CCA TTG GTA AAT CAG GCT GCT ATG GTG CTG CCT GTG TTT 
Met Asn Asn\Pro Leu Val Asn Gin Ala Ala Met Val Leu Pro Val Phe 

-20 \ -15 -10 ' 5 

TTG TTG ACT GCC TGT CTG GGC GCC GGC GGC ACT TTC GAT CTT GAT TCT 
Leu Leu Ser Ala\Cys Leu Gly Gly Cly Gly Ser Phe Asp Leu Asp Ser 

V l 5 10 

GTC GAT ACC GAA GCC CCG CGT CCC CCG CCA AAG TAT CAA GAT GTT TCT 
Val Asp Thr Glu Ala\ Pro Arg Pro Ala Pro Lys Tyr Gin Asp Val Ser 

15 V 20 25 

TCC GAA AAA CCG CAA GCC CAA AAA GAC CAA GGC GGA TAC GCT TTT GCG 
Ser Glu Lys Pro Gin Ala. Gin Lys Asp Gin Gly Gly Tyr Gly Phe Ala 
30 \ 35 40 

ATG AGG TTG AAA CGG AGG AAT TGG TAT CCG GGG GCA GAA GAA AGC GAG 
Met Arg Leu Lys Arg Arg Asn Tip Tyr Pro Gly Ala Glu Glu Ser Glu 

45 50 \ 55 60 




GCG ACG GGA TTG CCG ACA AAA 
u Ala Thr Gly Leu Pro Thr Lys 
70 75 



GTT AAA CTG AAC GAG ACT GAT T< 
Val Lys Leu Asn Glu Ser Asp T 
65 

CCC AAG GAA CTT CCT AAA CGG «A AAA TCG GTT ATT GAA AAA GTA GAA 
Pro Lys Glu Leu Pro Lys Arg Gin Lys, Ser Val He Glu Lys Val Glu 

00 85\ 90 

ACA GAC GGC GAC AGC GAT ATT TAT TCT TCC CCC TAT CTC ACA CCA TCA 
Thr Asp Gly Asp Ser Asp He Tyr Ser Ser v Pro Tyr Leu Thr Pro Ser 
95 100 \ 105 



AAC CAT CAA AAC GGC AGC GCT GGC AAC GGT GTA AAT CAA 
Asn His Gin Asn Gly Ser Ala Gly Asn Gly Val\ Asn Gin 
110 115 x 120 

CAG GCA ACA GGT CAC GAA AAT TTC CAA TAT GTT TAT TCC 
Gin Ala Thr Gly His Glu Asn Phe Gin Tyr Val Tyr, Ser 
125 130 135 \ 

TAT AAA CAT GCA GCG ACT GAA AAA GAT TTC AGT AAC AAA 
Tyr Lys His Ala Ala Ser Glu Lys Asp Phe Ser Asn Lys, 
145 150 



\ 



CCT AAA AAT 
Pro Lys Asn 



GGT TGG TTT 
Gly Trp Phe 
140 

AAA ATT AAG 
Lys He Lys 

\ 155 



TCA GGC GAC GAT GGT TAT ATC TTC TAT CAC GGT GAA AAA CCT TCC CGA 
Ser Gly Asp Asp Gly Tyr He Phe Tyr His Gly Glu Lys Pro Ser Arg 
160 165 170\ 

CAA CTT CCT GCT TCT GGA AAA GTT ATC TAC AAA GGT GTG TGG CAT TTT 
Gin Leu Pro Ala Ser Gly Lys Val He Tyr Lys Gly Val Trp His Phe 
175 180 185 x 



59 
107 

155 

203 

251 

299 

347 

395 

443 

491 

539 

587 

635 

683 
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k GTA ACC GAT ACA AAA AAG GGT CAA GAT TTT CGT GAA ATT ATC CAG CCT 
/*! Thr Asp Thr Lys Lys Gly Gin Asp Phe Arg Glu lie lie Gin Pro 
190 19* 200 



731 



AAA CAA GGC GAC AGG TAT AGC GGA TTT TCT GGT GAT GGC AGC 
ys Lys Gin Gly Asp Arg Tyr Ser Gly Phe Ser Gly Asp Gly Ser 
210 215 220 



779 



GAA GAA\TAT TCC AAC AAA AAC GAA TCC ACG CTG AAA GAT GAT CAC GAG 
Glu Glu Tyr Ser Asn Lys Asn Glu Ser Thr Leu Lys Asp Asp His Glu 
\ 225 230 235 



627 



GGT TAT GGT 
Gly Tyr Gly 



TTG ACG GGT 
Leu Thr Gly 
255 

AAT AAT GAC 
Asn Asn Asp 

270 

ACA GGC AAC 
Thr Gly Asn 
285 



lyTTT ACC 
PJhe Thr 
240 



TCG AAT TTA GAA GTG GAT TTC GGC AAT AAG AAA 
Ser Asn Leu Glu Val Asp Phe Gly Asn Lys Lys 
245 250 



AAA NTT A ATA CGC 
Lys Leu lie Arg 



AAA CAT^ACC ACC 
Lys His Thr Thr 

\ 275 



CGC TTC AAC GGC 
Arg Phe Asn Gly 
290 



AAT GAA ACC AAA CTA CAT CO 
Asn Glu Thr Lys Leu His Pro 
305 



AAT AAT GCG AGC CTA AAT AAT AAT ACT 
Asn Asn Ala Ser Leu Asn Asn Asn Thr 
260 265 

CAA TAC TAC AGC CTT GAT GCA CAA ATA 
Gin Tyr Tyr Ser Leu Asp Ale Gin lie 
280 

ACG GCA ACG GCA ACT GAC AAA AAA GAG 
Ale Thr Ala Thr Asp Lys Lys Glu 
295 300 

GTT TCC GAC TCG TCT TCT TTG AGC 
Val Ser Asp Ser Ser Ser Leu Ser 
310 315 




GGC GGC TTT TTC GGC CCG CAG GGT GAG GAA TTG GGT TTC CGC TTT TTG 
Gly Gly Phe Phe Gly Pro Gin Gly Glu Glu Leu Gly Phe Arg Phe Leu 

320 325\ 330 



875 



923 



971 



1019 



1067 



1115 



AGC GAC GAT CAA AAA GTT GCC GTT 
Ser Asp Asp Gin Lys Val Ala Val 
335 340 

AAA CTG GAA AAT GGC GCG GCG GCT 
Lys Leu Glu Asn Gly Ala Ala Ala 

350 355 

TCG GGC GGT GCG GCA GGC ACG TCG 
Ser Gly Gly Ala Ala Gly Thr Ser 
365 370 



GTC GGC AGC GCG 
Val Gly Ser Ala 



Lys 
345 



TCA GGC AGC ACA GGT 
Ser Gly Ser v Thr Gly 



360 



\ 



TCT GAA AAC AGT AAG 
Ser Glu Asn Ser Lys 

375 \ 



ACC AAA GAC 
Thr Lys Asp 



GCG GCA GCA 
Ala Ala Ala 



CTG ACC ACG 
Leu Thr Thr 
380 



1163 



1211 



1259 



GTT TTG GAT GCG GTT GAA TTG ACA CTA AAC GAC AAG AAA ATC AAA AAT 
Val Leu Asp Ala Val Glu Leu Thr Leu Asn Asp Lys Lys He Lys Asn 

385 390 \ 395 



CTC GAC AAC TTC AGC AAT GCC GCC CAA CTG GTT GTC GAC GGC ATT ATG 

Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val Val Asp Gly He Met 

400 405 410^ 

ATT CCG CTC CTG CCC AAG GAT TCC GAA AGC GGG AAC ACT CAG GCA GAT 

He Pro Leu Leu Pro Lys Asp Ser Glu Ser Gly Asn Thr Gin Ala Asp 
415 420 425 \ 



1307 



1355 



1403 



AAA GGT AAA AAC GGC GGA ACA GAA TTT ACC CGC AAA TTT GAA CAC ACG 
Lys Gly Lys Asn Gly Gly Thr Glu Phe Thr Arg Lys Phe Glu His Thr 
430 435 440 % 



1451 
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fcCG GAA ACT GAT 
Pro Glu Ser Asp 
4? 

ACC GCT 
Thr Ala 



ACC TAT 
Thr Tyr 



GGA ATG TTG ACG 
Gly Met Leu Thr 
495 \ 

AAC ACT ACT GAA 
Asn Ser Ser Gin 
510 

TTC CTC CAA GGC 
Phe Leu Gin Gly 
525 

AAC GTC GTT TAT 
Asn Val Val Tyr 



AGC TGG AGC GGC 
Ser Trp Ser Gly 
560 

TTT ACT GTG AAT 
Phe Thr Val Asn 

575 

GAA AAC AGG CAG 
Glu Asn Arg Gin 
590 

AAC GGC TTT GAA 
Asn Gly Phe Glu 
605 

GAT CAA AAA AAT 
Asp Gin Lys Asn 



AAG GTA AAG GGC 
Lys Val Lys Gly 
640 

TGG TTT GCC TAT 
Trp Phe Ala Tyr 
655 

TCC AGC GAT GGA 
Ser Ser Asp Gly 
670 

AAA CGC CAA CAG 
Lys Arg Gin Gin 
685 
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AAA AAA GAC GCC CAA GCA GCT ACG CAG ACG AAT GGG 
Lvs Lys Asp Ala Gin Ala Gly Thr Gin Thr Asn Gly 
450 455 460 

TCA AAT ACG GCA GCT GAT ACC AAT GGC AAA ACA AAA 
Ser Asn Thr Ala Gly Asp Thr Asn Gly Lys Thr Lys 
465 470 475 

GAA GTC TGC TCT TCC AAC CTC AAT TAT CTG AAA TAC 
Glu Val Cys Cys Ser Asn Leu Asn Tyr Leu Lys Tyr 
485 490 

CGC AAA AAC AGC AAG TCC GCC ATG CAG GCA GGA GGA 
Arg Lys Asn Ser Lys Ser Ala Met Gin Ala Gly Gly 

500 505 

^GCT GAT GCT AAA ACG GAA CAA GTT GAA CAA ACT ATG 
Ala Asp Ala Lys Thr Glu Gin Val Glu Gin Ser Met 
515 520 



GAG CCT ACC GAT GAA AAA GAG ATT CCA ACC GAC CAA 
Glu Arg Thr Asp Glu Lys Glu He Pro Thr Asp Gin 
530\^ 535 540 

CGG GGG TCT TGG TAC GGG CAT ATT GCC AAC GGC ACA 
Arg Gly SerxTrp Tyr Gly His He Ala Asn Gly Thr 
545 \ /} 550 555 



AAT GCT TCT 
Asn Ala Ser 




GAG GGC GGC AAC AGG GCG GAA 
rp Lys Glu Gly Gly Asn Arg Ale Glu 



565 



570 



TTT GCC GAT AAA AAA\ ATT ACC GGC AAG TTA ACC GCT 
Phe Ala Asp Lys Lys He Thr Gly Lys Leu Thr Ala 
580 \ 585 

GCG CAA ACC TTT ACC ATT\GAG GGA ATG ATT CAG GCC 
Ala Gin Thr Phe Thr He Glu Gly Met He Gin Gly 
595 \ 600 

GCT ACG GCG AAA ACT CCT GAG\TCA GGT TTT GAT CTC 
Gly Thr Ala Lys Thr Ala Glu Ser Gly Phe Asp Leu 
610 615 \ 620 

ACC ACC CGC ACG CCT AAG GCA TATvATC ACA GAT GCC 
Thr Thr Arg Thr Pro Lys Ala Tyr He Thr Asp Ala 
625 €30 \ 635 

GGT TTT TAC GGG CCT AAA GCC GAA GAG, TTG GGC GGA 
Gly Phe Tyr Gly Pro Lys Ala Glu Glu^Leu Gly Gly 
645 650 

CCG GGC GAT AAA CAA ACG GAA AAG GCA ACA GCT ACA 
Pro Gly Asp Lys Gin Thr Glu Lys Ala ThrNAla Thr 
660 665 

AAT TCA GCA AGC AGC GCG ACC GTG GTA TTC GCT GCG 
Asn Ser Ala Ser Ser Ala Thr Val Val Phe Gly\Ala 
675 680 

CCT CTG CAA TAAGCACGCT TGCCGAACAA TCAAGAATAA 
Pro Val Gin 
690 



1499 



1547 



1595 



1643 



1691 



1739 



1787 



1835 



1883 



1931 



1979 



2027 



2075 



2123 
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(2^ INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(i\) MOLECULE TYPE: protein 

(xi) NSEQUENCE DESCRIPTION: SEQ ID NO: 2: 



PCT/FR95/00701 

2230 



Met Asn Asn Pr\ Leu Val Asn Gin Ala Ala Met Val Leu Pro Val Phe 

-20 \ -15 -10 -5 

Leu Leu Ser Ala Cyk Leu Gly Gly Gly Gly Ser Phe Asp Leu Asp Ser 

l\ 5 10 

Val Asp Thr Glu Ala Pro Arg Pro Ala Pro Lys Tyr Gin Asp Val Ser 

15 \ 20 25 

Ser Glu Lys Pro Gin Ala Gin Lys Asp Gin Gly Gly Tyr Gly Phe Ala 
30 35l 40 



Met Arg Leu Lys Arg Arg Asn 
45 50 

Val Lys Leu Asn Glu Ser Asp 
65 



rrp 



ryr Pro Gly Ala Glu Glu Ser Glu 

55 60 

»lu Ala Thr Gly Leu Pro Thr Lys 

70 75 



Pro Lys Glu Leu Pro Lys Arg Gin Lys\ Ser Val lie Glu Lys Val Glu 
80 8S\ 90 

Thr Asp Gly Asp Ser Asp lie Tyr Ser SeNr Pro Tyr Leu Thr Pro Ser 
95 100 \ 105 

Asn His Gin Asn Gly Ser Ala Gly Asn Gly fcal Asn Gin Pro Lys Asn 
110 115 \ 120 

Gin Ala Thr Gly His Glu Asn Phe Gin Tyr ValVyr Ser Gly Trp Phe 
125 130 135 \ 140 

Tyr Lys His Ala Ala Ser Glu Lys Asp Phe Ser Asn, Lys Lys lie Lys 
145 150 \ 155 

Ser Gly Asp Asp Gly Tyr lie Phe Tyr His Gly Glu l^n Pro Ser Arg 
160 165 \ 170 

Gin Leu Pro Ala Ser Gly Lys Val lie Tyr Lys Gly Val ^rp His Phe 
175 180 185 

Val Thr Asp Thr Lys Lys Gly Gin Asp Phe Arg Glu lie Ile^Gln Pro 
190 195 200 

Ser Lys Lys Gin Gly Asp Arg Tyr Ser Gly Phe Ser Gly Asp Gly Ser 
205 210 215 \ 220 

Glu Glu Tyr Ser Asn Lys Asn Glu Ser Thr Leu Lys Asp Asp HisVsiu 
225 230 235 
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X 1 



ply Tyr Gly Phe Thr Ser Asn Leu Glu Val Asp Phe Gly Asn Lys Lys 



240 



24S 



Le\ Thr Gly Lys Leu He Arg Asn Asn Ala Ser Leu Asn Asn Asn Thr 

255 260 265 

Asn ^kn Asp Lys His Thr Thr Gin Tyr Tyr Ser Leu Asp Ala Gin He 

275 280 

Thr Gly\sn Arg Phe Asn Gly Thr Ala Thr Ala Thr Asp Lys Lys Glu 
J ^ 295 300 



285 



290 



Asn Glu Thr\ys Leu His Pro Phe Val Ser Asp Ser Ser Ser Leu Ser 

305 310 315 

Gly Gly Phe Phe\ly Pro Gin Gly Glu Glu Leu Gly Phe Arg Phe Leu 
320 \ 325 330 

Ser Asp Asp Gin Lys\al Ala Val Val Gly Ser Ala Lys Thr Lys Asp 
335 \ 340 34S 

Lys Leu Glu Asn Gly AleNAla Ala Ser Gly Ser Thr Gly Ala Ala Ala 
350 355 360 

Ser Gly Gly Ala Ala Gly ThXv Ser Ser Glu Asn Ser Lys Leu Thr Thr 

365 370 \ ^ 375 380 

Val Leu Asp Ala Val Glu Leu TKJP Leu Asn Asp Lys Lys He Lys Asn 
385 ^ 390 3« 

\ 

Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val Val Asp Gly He Met 
400 405 410 

He Pro Leu Leu Pro Lys Asp Ser Glu Ser Gly Asn Thr Gin Ala Asp 
415 420 \ 425 

\ 

Lys Gly Lys Asn Gly Gly Thr Glu Phe Thr *Arg Lys Phe Glu His Thr 
430 435 \ 440 

Pro Glu Ser Asp Lys Lys Asp Ala Gin Ala Gly.Thr Gin Thr Asn Gly 
445 450 45SV 460 

Ala Gin Thr Ala Ser Asn Thr Ala Gly Asp Thr Asn Gly Lys Thr Lys 
465 470 \ 475 

\ 

Thr Tyr Glu Val Glu Val Cys Cys Ser Asn Leu Asn Tyr Leu Lys Tyr 
480 485 \ 490 

V 

Gly Met Leu Thr Arg Lys Asn Ser Lys Ser Ala Met Gin Ala Gly Gly 

495 500 505 \ 

Asn Ser Ser Gin Ala Asp Ala Lys Thr Glu Gin Val Glu Gin Ser Met 
510 515 520 



Phe Leu Gin Gly Glu Arg Thr Asp Glu Lys Glu He Pro Thr Asp Gin 
525 530 535 \ 540 

Asn Val val Tyr Arg Gly Ser Trp Tyr Gly His He Ala Asn Gly^Thr 
545 550 555\ 

Ser Trp Ser Gly Asn Ala Ser Asp Lys Glu Gly Gly Asn Arg Ala Glu 

560 565 570 x 
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Phe Vr Val Aan Phe Ala Asp Lys Lys lie Thr Gly Lys 

580 5 " 

Glu As^Arg Gin Al* Gin Thr Phe Thr lie Glu Gly Met 
v 595 600 

Asn Gly pke Glu Gly Thr Al* Lys Thr Al* Glu Ser Gly 
605 \ 610 615 

Asp Gin Lys\sn Thr Thr Arg Thr Pro Lys Ala Tyr He 

^ 625 "0 

Lys V*l Lys Gl^Gly Phe Tyr Gly Pro Lys Al* Glu Glu 

640 \ "5 

Trp Phe Al* Tyr Pr\ Gly Asp Lys Gin Thr Glu Lys Al* 
655 \ 660 

ser Ser Asp Gly Asn Sex Ala Ser Ser Al* Thr Val Val 
670 \ "5 «0 

Lya Arg Gin Gin Pro Val 6ln 
685 690 



L*u Thr Al* 

lie Gin Gly 

Phe Asp Leu 
620 

Thr Asp Ala 

635 

Leu Gly Gly 
650 

Thr Ala Thr 
Phe Gly Ala 




(2) INFORMATION FOR SBQ ID 



10 



15 



(i) SEQUENCE CHARACTER IS ^ 

(A) LENGTH: 1808 bas\ pairs 

(B) TYPE: nucleic 

(C) STRAND EDNESS : sinqMe 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genoa^) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: IM2394 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 1..60 

( ix) FEATURE : 

(A) NAME/KEY: matpeptide 

(B) LOCATION: 61.. 1797 



20 



25 



30 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1797 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 61.. 1035 

(ix) FEATURE: 

(A) NAME/KEY: miec_f eature 

(B) LOCATION: 1036.. 1386 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1387.. 1797 

(ix) FEATURE: 

(A) NAME/KEY: misc_binding 

(B) LOCATION: 46.. 1050 
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ci) SEQUENCE DESCRIPTION : SBQ ID NO: 3: 



ATG AAC AAT 
Met Asn Asn 

-20 



V TTG CTA AAT CAG CCT OCT ATG GTG CTG CCT CTG TTT 
,eu Val Aan Gin Ala Ala Hat Val Leu Pro Val Phe 

-15 -10 



TTG TTG ACT GCT TgAcTG GGT GGC GGC CGC ACT TTC GAT TTG GAC AGC 
Leu Lau Ser Ala Cys \au Cly Gly Gly Gly Sar Pha Asp Leu Asp Ser 

1 \ 5 10 



GTG GAA ACC GTG CAA GAT\ 
Val Glu Thr Val Gin Asp 
IS 



PVTG CAC TCC AM CCT AAG TAT GAG GAT GAA 
t His Sar Lys Pro Lys Tyr Glu Asp Glu 
20 25 



AAA AGC CAG CCT GAA AGC CAAXCAG GAT GTA TCG GAA AAC AGC GGC CCG 

Lys Ser Gin Pro Glu Ser Gln>Gl n *«P V * A S * r GXu Asn Ser tty 
30 35 \ 40 . 

GCT TAT GGC TTT GCA GTA AAA CTXvCCT CGC CGG AAT CCA CAT TTT AAT 
Ala Tyr Gly Pha Ala Val Lys Lau \ro Ax 9 Arg Asn Ala His Pha Aan 
45 50 \yO 55 

GGT TCA ATG CAT TCG AAA 
|u Gly Sar Mac Asp Trp Lys 
75 



CCT AAA TAT AAG GAA AAG CAC AAA CC 
Pro Lys Tyr Lys Glu Lys His Lys Pi 
65 



AAA CTG CAA AGA GGA GAA CCA AAT ACT TTT 
Lys Leu Gin Arg Gly Glu Pro Asn Sar Ph« 
80 »5 

GAA AAA AAA CGG GGT ACT TCT GAA CTT ATT GAA 
Glu Lys Lys Arg Gly Sar Ser Glu Leu lie Glu 
95 100 



GAG AGG GAT GAA TTG 
Glu Arg Asp Glu Lau 
90 

AAA TCG GAA GAT 
Lys Trp Glu Asp 
105 



GGG CAA ACT CCT GTA CTT GCT TAT ACA AAT TTC ACT ^T CTC CCT TCG 
Gly Gin Ser Arg Val Val Gly Tyr Thr Asn Pha Thr Tyr Val Arg Ser 
110 115 120 



KT AAT ATA 
Asn He 
140 



GGA TAT CTT TAC CTT AAT AAA AAT AAT ATT GAT ATT AAG 
Gly Tyr Val Tyr Leu Asn Lys Asn Asn He Asp He Lys 

125 130 135 

GTT CTT TTT GGA CCT GAC GGA TAT CTT TAC TAT AAA GGG AAA 
Val Leu Phe Gly Pro Asp Gly Tyr Lau Tyr Tyr Lys Gly Lys 
145 150 

TCC AAG GAG CTG CCA TCG GAA AAG ATA ACT TAT AAA GGT ACT TGG 
Ser Lys Glu Leu Pro Ser Glu Lys He Thr Tyr Lys Gly Thr Trp 
160 165 170 

TAT GTT ACT GAT GCT ATG GAA AAA CAA AGG TTT GAA GGA TTG GCT ACT > 
Tvr Val Thr Asp Ala Met Glu Lys Gin Arg Phe Glu Gly Leu Gly Ser 
175 180 185 

CCA GCA GGA GGA GAT AAA TCG GGG GCG TTG TCT GCA TTA GAA GAA GGG 
Ala Ala Gly Gly Asp Lys Sar Gly Ala Lau Sar Ala Lau Glu Glu Gly 
190 195 200 

GTA TTG CCT AAT CAG GCA GAG GCA TCA TCC GGT CAT ACC GAT TTT GCT 
Val Leu Arg Asn Gin Ala Glu Ala Ser Ser Gly His Thr Asp Phe Gly 
205 210 215 220 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 
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_ ACT ACT GAG TTT GAG CTT GAT TTT TCT GAT AAA ACA ATA AAG CGC 
Et Thr Ser Glu Phe Glu Val Asp Phe S«x Asp Lys Thr lie Lys Gly 

225 230 : MS 

ACA >CTT TAT CGT AAC AAC CGT ATT ACT CAA AAT AAT ACT GAA AAC AAA 
Thr Deu Tyr Arg Asn Asn Arg lis Thr Gin Asn Asn Ser Glu Asn Lys 
240 245 250 

CAA ATA AAA ACT ACG CGT TAC ACC ATT CAA GCA ACT CTT CAC GGC AAC 
Gin He ty« Thr Thr Arg Tyr Thr He Gin Ale Thr Leu His Gly Asn 

260 265 



768 



816 



864 



CGT TTC AAANcGT AAG GCG TTG GCG GCA GAT AAA CGT GCA ACA AAT GGA 
Arg Phe Lys Gly Lys Ala Leu Als Ala Asp Lys Gly Ala Thr Asn Gly 

270 \ 275 280 



ACT CAT 
Ser His 

285 



CCC TTT 
Pro Ph< 



iTT TCC GAC TCC GAC ACT TTG GAA GGC GGA TTT TAC 
Ser Asp Ser Asp Ser Leu Glu Gly Gly Phe Tyr 
290 295 300 



te lie ■ 



GGG CCC AAA GGC GAG N GAA CTT GCC GGT AAA TTC TTG AGC AAC GAC AAC 
Gly Pro Lys Gly Glu Glu Leu Ala Gly Lys Phe Leu Ser Asn Asp Asn 

305 \ 310 



AAA GTT GCA GCG GTG TTT 
Lys Val Ala Ala Val Phe 

320 




315 



GCG AAG CAC AAA GAT AAG AAC GAT GGG 
Gly Ala Lys Gin Lys Asp Lys Lys Asp Gly 

325 330 



GAA AAC GCG GCA GGG CCT 
Glu Asn Ala Ala Gly Pro Ala 1 
335 



GAA ACC GTG ATA GAT GCA TAC CCT 
for Glu Thr Val lie Asp Ala Tyr Arg 
340 345 



ATT ACC GGC GAG GAG TTT AAG AAA GAG CAA ATA GAC ACT TTT GGA GAT 
He Thr Gly Glu Glu Phe Lys Lys Glu Gin Zle Asp Ser Phe Gly Asp 

350 355 \ 360 

GTG AAA AAG CTG CTG CTT GAC GGA GTG GAG CTT TCA CTG CTC CCC TCT 
Val Lys Lys Leu Leu Val Asp Gly Val Glu Leu Ser Leu Leu Pro Ser 

365 370 \375 380 



GAG GGC AAT AAG GCG GCA TTT CAG CAC GAC 
Glu Gly Asn Lys Ala Ala Phe Gin His Glu 
385 390 

AAG GCA ACC CTG TGT TCT TCC AAC TTG GAT 
Lys Ala Thr Val Cys Cys Ser Asn Leu Asp 
400 *05 

CTG TCA AAA GAA AAT AAA GAC GAT ATG TTC 
Leu Ser Lys Glu Asn Lys Asp Asp Met Pha 
415 420 

CCA GTA TCC GAT CTG GCG GCA AGG ACG GAG 
Pro Vel Ser Asp Val Ala Ala Arg Thr Glu 
430 435 

GGT ACT TCG TAC GGA TAT ATT GCC AAC GGC 
Gly Thr Trp Tyr Gly Tyr He Ala Asn Gly 
445 450 



ATT GAG CAA AAC 
Glu Gin Asn 



H N e^GJ 



TAC ATG ACT 
Mat 



Tyr 



Ser 



\ 



TTT 
Phe 
410 



CTG CAA GGT CTC 
Leu Gin Gly Val 
425 

\ 



GCA AAC 
Ala Asn 
440 

ACA AGC 
Thr Ser 
455 



GCC\AAA 
Ala Lys 



\ 



GGC GTG 
Gly Val 
395 

GGG AAG 
Gly Lys 



CCC ACT 
Arg Thr 



TAT CGC 
Tyr Arg 



TGG AGC, 
Trp Ser 



CGC GAA 
VGly Glu 
\ 460 



GCC TCC AAT CAG GAA CCT GGT AAT AGG GCA GAG TTT GAC GTG GAT TTT 
Ala Ser Asn Gin Glu Gly Gly Asn Arg Ala Glu Phe Asp Val Asp Phe 
465 470 475\ 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1498 
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ACT AAA AAA ATC ACT GGC ACA CTG ACG GGA AAA GAC CCT ACG TCT 
Thr Ly* Lys lie Sar Cly Thr Leu Thr Ala Lys Aap Arg Thr Ser 
480 «5 490 

CC TTT ACT ATT ACT CCC ATC ATT AAG GAC AAC CGT TTT TCA GOT 
Phe Thr He Thr Ala Het He Lya Asp Asn Gly Phe Ser Gly 
493 500 505 



GTG GCG 
Val Ala 
510 



^ ACC GGT GAA AAC GGC TTT GCG CTG GAT CCG CAA AAT ACC 
tra Thr Gly Glu Aan Gly Phe Ala Leu Asp Pro Gin Asn Thr 
* 515 520 



GGA AAT TCC 
Gly Asn Ser 

525 



TAT ACG CAT ATT GAA CCC ACT GTA TCC GGC GGT TTC 
Tyr Thr His He Glu Ala Thr Val Ser Gly Gly Phe 
530 535 540 




TAC GGC AAA AAC 

Tyr Gly Lys Asn 



ATC GAG ATG GGC GGA TCG TTC TCA TTT CCG GGA 
He Glu Het Gly Gly Ser Phe Ser Phe Pro Gly 
550 555 



AAT GCA CCA GAG GGA AAA CAA GAA AAA GCA TCG GTG GTA TTC CGT GCG 
Asn Ala Pro Glu Gly L?a Gin Glu Lys Ala Ser Val Val Phe Gly Ala 
560 \ 565 570 



CCC CAA GAG CTT GTG 
Lys Arg Gin Gin Leu Val 

575 



TAAGCACGGC T 



1536 



1584 



1632 



1680 



1728 



1776 



1808 



(2) INFORMATION FOR SBQ2DKO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear\ 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Aan Aan Pro Leu Val Aan Gin Ala Ala\Met Val Leu Pro Val Phe 

-20 -15 \10 "5 

Leu Leu Ser Ala Cya Leu Gly Gly Gly Gly Ser Phe Aap Leu Aap Ser 

1 5 \ 10 

Val Glu Thr Val Gin Asp Met His Ser Lys Pro Lys Tyr Glu Aap Glu 

15 20 \ 25 

Lys Ser Gin Pro Glu Ser Gin Gin Asp Val Ser Glu^Asn Ser Gly Ala 
30 35 40 \ 

Ala Tyr Gly Phe Ala Val Lya Leu Pro Arg Arg Aan Ala Hi a Phe Asn 

45 50 55 \ 60 

Pro Lya Tyr Lya Glu Lys His Lys Pro Leu Gly Ser Met Aap Trp Lys 

65 70 \ '5 

Lvs Leu Gin Arg Gly Glu Pro Asn Ser Phe Ser Glu Arg Asp\ Glu Leu 
7 80 85 90 X 

Glu Lys Lys Arg Gly Ser Ser Glu Leu He Glu Ser Lys Trp Glu Aap 
95 100 105 v 
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Gly Gin Ser Arg Val Val Gly Tyr Thr Asn Phe Thr Tyr Val Arg Ser 

LO 115 120 

Glv Tyk Val Tyr Leu Asn Lys Asn Asn lie Asp He Lys Asn Asn He 
125 \ 130 135 140 

Val Leu PliW Gly Pro Asp Gly Tyr Leu Tyr Tyr Lys Gly Lys Glu Pro 
^ 145 150 155 

Ser Lys Glu Le\i Pro Ser Glu Lys He Thr Tyr Lys Gly Thr Trp Asp 

160\ 165 170 

Tyr Val Thr Asp Ala Met Glu Lys Gin Arg Phe Glu Gly Leu Gly Ser 
175 \ 180 185 

Ala Ala Gly Gly Asp \ys Ser Gly Ala Leu Ser Ala Leu Glu Glu Gly 
190 \ 195 200 

Val Leu Arg Asn Gin AlaXdu Ala Ser Ser Gly His Thr Asp Phe Gly 
205 210 \ 215 220 

Met Thr Ser Glu Phe Glu Va\ Asp Phe Ser Asp Lys Thr He Lys Gly 

225 A 230 235 



Thr Leu Tyr Arg Asn Asn Arg IJ 
240 



Gin Asn Asn Ser Glu Asn Lys 

15 250 



Gin He Lys Thr Thr Arg Tyr Thr He Gin Ala Thr Leu His Gly Asn 
255 260 \ 265 

Arg Phe Lys Gly Lys Ala Leu Ala Ala ^Asp Lys Gly Ala Thr Asn Gly 
270 275 \ 280 

Ser His Pro Phe He Ser Asp Ser Asp Ser* Leu Glu Gly Gly Phe Tyr 
285 290 \?95 300 

Gly Pro Lys Gly Glu Glu Leu Ala Gly Lys pW Leu Ser Asn Asp Asn 
305 310 \ 315 

Lys Val Ala Ala Val Phe Gly Ala Lys Gin Lys\*sp Lys Lys Asp Gly 

320 325 \ 330 

Glu Asn Ala Ala Gly Pro Ala Thr Glu Thr Val Il\ Asp Ala Tyr. Arg 
335 340 \345 

He Thr Gly Glu Glu Phe Lys Lys Glu Gin He Asp ier Phe Gly Asp 
350 355 360 

Val Lys Lys Leu Leu Val Asp Gly Val Glu Leu Ser Leu\Leu Pro Ser 
365 370 375 \ 380 

Glu Gly Asn Lys Ala Ala Phe Gin His Glu He Glu Gin Asn Gly Val 
385 390 \ 395 

Lys Ala Thr Val Cys Cys Ser Asn Leu Asp Tyr Met Ser Phe \Gly Lys 
400 405 410 v 

Leu Ser Lys Glu Asn Lys Asp Asp Met Phe Leu Gin Gly Val A*g Thr 
415 420 425 

Pro Val Ser Asp Val Ala Ala Arg Thr Glu Ala Asn Ala Lys Tyr\Arg 
430 435 440 
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GlV Thr Trp Tyr Gly Tyr He Ala 
445\ 450 

Ala S\r Asn Gin Glu Gly Gly Asn 
465 



Asn Gly Thr Ser Trp Ser Gly Glu 
455 460 

Arg Ala Glu Phe Asp Val Asp Phe 
470 475 



Ser Thr hys Lys He Ser Gly Thr 
480 



Leu Thr Ala Lys Asp Arg Thr Ser 
485 490 



Pro Ala PheVhr He Thr Ala Met 
495 \ 500 



He Lys Asp Asn Gly Phe Ser Gly 
505 



Val Ala Lys Thr\Gly Glu Asn Gly 
510 \ 515 



Phe Ala Leu Asp Pro Gin Asn Thr 

520 



Gly Asn Ser His TyrVThr His He 
525 \Z0 



Glu Ala Thr Val Ser Gly Gly Phe 
535 540 



Tyr Gly Lys Asn Ala Il\ Glu Met 
545 



Gly Gly Ser Phe Ser Phe Pro Gly 
550 555 



Asn Ala Pro Glu Gly Lys Gi^n Glu 
560 



Lys Ala Ser Val Val Phe Gly Ala 
565 570 



Lys Arg Gin Gin Leu Val Gin 
575 

(2) INFORMATION FOR SEQ ID NO 



(i) 



10 



15 




SEQUENCE CHARACTERISE 

(A) LENGTH: 2255 base 

(B) TYPE: nucleic acic 

(C) STRANDEDNESS : singi 

( D ) TOPOLOGY : 1 inear 



(ii) 
(vi) 

(ix) 



MOLECULE TYPE: DNA (genomi^J 

ORIGINAL SOURCE: 

(A) ORGANISM: N. meningi tid\s 

(B) STRAIN: M97 8 

FEATURE : 

(A) NAME /KEY : mat_pepfcide 

(B) LOCATION: 1..2115 

FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.-2115 

SEQUENCE DESCRIPTION: SEQ ID NO 



TGT CTG GGT GGC GGC GGC ACG TTC GAT CTT GAT TCT GTC GAT 

Cys Leu Gly Gly Gly Gly Thr Phe Asp Leu Asp Ser Val Asp 
15 10 

GCC CCG CGT CCC GCC CCA AAA TAT CAA GAT GTT TCT TCC GAA AAA Cfc 

Ala Pro Arg Pro Ala Pro Lys Tyr Gin Asp Val Ser Ser Glu Lys 
20 25 30 

CAA GCC CAA AAA GAC CAA GGC GGA TAC GGT TTT GCA ATG CGC CTC AAG 

Gin Ala Gin Lys Asp Gin Gly Gly Tyr Gly Phe Ala Met Arg Leu Lys 
35 40 45 



48 



96 



144 
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CG& COG AAT TCC CAT CCC CAC CCA AAT CCT AAA GAA GAT GAG ATA AAA 
ArgNArg Asn Trp His Pro Gin Ala Asn Pro Lys Glu Asp Glu lie Lys 

55 60 



192 



CTT TC\ GAA AAT GAT TCC GAG GCG ACA GGA TTG CCA CCC AAT CCC AAA 
Leu Ser\Glu Asn Asp Trp Glu Ala Thr Gly Leu Pro Cly Asn Pro Lys 
65 \ 70 75 BO 



240 



AAC TTA CCTv GAG CGA CAG AAA TCG GTT ATT GAA AAA GTA AAA ACA CCC 
Asn Leu ProXJlu Arg Gin Lys Ser Val He Glu Lys Val Lys Thr Gly 
85 90 95 



286 



AGC GAC AGC AAT 
Ser Asp Ser Asn 
100 



VTT TAT TCT TCC CCC TAT CTC ACG CAA TCA AAC CAT 
Tyr Ser Ser Pro Tyr Leu Thr Gin Ser Asn His 
105 110 



336 



CAA AAC 

Gin Asn 



GAG TTC 
Glu Phe 
130 



GGC ACT 
Gly Ser 
115 

AAA TAT 

Lys Tyr 



GGA AAC CAA CCA AAA 

Ala Asn Gin Pro Lys 
120 



*t\cc 



GTT TAT tCC CCT TCG 

Val Tyr Ser Gly Trp 

13S\ 



AAT GAA 

Asn Glu 



TTT TAC 

Phe Tyr 



GTA AAA 
Val Lys 

125 

AAA CAC 
Lys His 
140 



GAT TAT AAA 
Asp Tyr Lys 

GCT AAA CTC 
Ala Lys Leu 



384 



432 



GAA ATC 
Glu He 
145 

GAC GGT 
Asp Gly 



.GTT TCT 
Val Ser 



ACG AAA 

Thr Lys 



ATA AAA 

He Lys 



TAT ATC 
Tyr He 



GGA GAA 
Gly Glu 
180 

CAG GGA 

Gin Gly 
195 



GAA AAC AAC TTA ATT 

Glu Asn Asn Leu He 
150 



AAG GGT 

Lys Gly 
155 



TTT TAT CAC GCT 
Phe Tyr His Gly Gl 
165 

GTT ACC TAC AAA GGC 
Val Thr Tyr Lys Gly 
185 

CAA AAA TTT AAC GAT 
Gin Lys Phe Asn Asp 

200 




ATT CTT 
He Leu\ 



GCA AAG 
Ala Lys 

TCC CGA 
Ser Arg 



CAT TTT 
His Phe 



GGA ACC 

Cly Thr 
205 



AGC GGC GAC 

Ser Gly Asp 
160 

CAA CTT CCC 
Gin Leu Pro 
175 

GTA ACC GAT 
Val Thr Asp 

190 

TCA AAA AAA 
Ser Lys Lys 



480 



528 



576 



624 



CAA GGC GAC AGG TAT AGC GGA TTT CCG GGT GAT GAC GGC GAA GAA TAT 

Gin Gly Asp Arg Tyr Ser Gly Phe Pro Gly Asp As& Gly Glu Glu Tyr 
210 215 220 



672 



TCC AAT AAA AAT GAA GCG ACT TTA CAA GGC AGT CAA GAG GGT TAT GGT 
Ser Asn Lys Asn Glu Ala Thr Leu Gin Gly Ser Gin Glu Gly Tyr Gly 
225 230 235 \ 240 



TTT ACC TCA AAT TTA AAA GTG GAT 
Phe Thr Ser Asn Leu Lys Val Asp 
245 

GAA TTG ATA CGC AAT AAT AGA GTT 
Glu Leu He Arg Asn Asn Arg Val 
260 

TAC ACC ACC CAA TAT TAC AGC CTT 
Tyr Thr Thr Gin Tyr Tyr Ser Leu 

275 280 



TTC AAT AAG AAA 
Phe Asn Lys Lys 
250 

ACA AAC GCT ACT 
Thr Asn Ala Thr 

265 

GAG GCT CAA GTA 
Glu Ala Gin Val 



AAA TTG ACG GGT 
Lys Leu, Thr Gly 
\255 

\ 

GCT AAC GMT AAA 
Ala Asn Asj> Lys 
270 



\ 



ACA GGC AAC CGC 
Thr Gly Asn Arg 

285 



720 



768 



816 



864 



TTC AAC GGC AAG GCA ACC CCA ACC GAC AAA CCT CGC ACT CGA GAA ACC^ 
Phe Asn Gly Lys Ala Thr Ala Thr Asp Lys Pro Gly Thr Cly Glu Thr 
290 295 300 



912 
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A. 



AAA \CAA CAT CCC TTT GTT TCC GAC TCC TCT TCT TTG AGC GGC CCC TTT 
Lys Gin His Pro Phe Val Scr Asp Ser Ser Ser Leu Ser Gly Gly Phe 
305 \ 310 315 320 

TTC GGc\cG AAG GGT GAG GAA TTG GGT TTC CGC TTT TTG AGC AAC GAT 
Phe Gly Pro Lys Gly Glu Glu Leu Gly Phe Arg Phe Leu Ser Asn Asp 
* 325 330 335 

CAA AAA GTTV;CC GTT GTC GGC AGC GCG AAA ACC CAA GAC AAA GCC CCA 
Gin Lys Val Ala Val Val Gly Ser Ala Lys Thr Gin Asp Lys Ala Ala 

345 350 

AAT GGC AAT ACT\gCG GCG GCT TCA GGC GGC ACA GAT GCG CCA GCA TCA 
Asn Gly Asn Thr Ala Ala Ala Ser Gly Gly Thr Asp Ala Ala Ala Ser 

355 \ 360 365 



960 



1008 



1056 



1X04 



AAC GGT GCG GCA GGC> 
Asn Gly Ala Ala Gly 
370 



TCG TCT GAA AAC ACT AAG CTG ACC ACG GTT 
jr Ser Ser Glu Asn Ser Lys Leu Thr Thr Val 

375 380 



1152 



TTG GAT GCG GTT GAA TTG 

Leu Asp Ala Val Glu Leu Tfl 
385 390 

GAC AAC TTC AGC AAT GCC GCC 
Asp Asn Phe Ser Asn Ala Ala 
405 



CTA AAC GAC AAG AAA ATC AAA AAT CTC 
Leu Asn Asp Lys Lys He Lye Asn Leu 
395 400 

CTG GTT GTC GAC GGC ATT ATC ATT 
Leu Val Val Asp Gly He Met He 
410 415 



1200 



1248 



CCG CTC CTG CCC GAG ACT TCC GAA 
Pro Leu Leu Pro Glu Thr Ser Glu Sei 
420 42S 



AGC AAT GAG GCA GAT AAA 
Ser Asn Gin Ala Asp Lys 
430 



1296 



GGT AAA AAA GGT AAA AAC GGT AAA AAC GG& GGA ACA GAC TTT ACC TAC 
Gly Lys Lys Gly Lys Asn Gly Lys Asn GlyXciy Thr Asp Phe Thr Tyr 
435 440 \ 445 



1344 



AAA ACA ACC TAC ACG CCG AAA AAC GAT GAC AAXs 
Lys Thr Thr Tyr Thr Pro Lys Asn Asp Asp Lys 
450 455 



GAT 



ACC AAA GCC CAA 
Thr Lys Ala Gin 



1392 



ACA GGT GCG GCA GGC TCT AGC GGC GCA CAA ACC GAT^TTG GGT AAG GCG 
Thr Gly Ala Ala Gly Ser Ser Gly Ala Gin Thr Asp Leu Gly Lys Ala 
465 470 475 \ 400 



1440 



GAC GTT AAC GGC GGT AAG GCA GAA ACA AAA ACC TAT GAA toC GAA GTC 
Asp Val Asn Gly Gly Lys Ala Glu Thr Lys Thr Tyr Glu Val Glu Val 
485 490 \ 495 

TGC TGT TCC AAC CTC AAT TAT CTG AAA TAC GGA ATG TTG ACG feGT AAA 
Cya Cys Ser Asn Leu Asn Tyr Leu Lys Tyr Gly Met Leu Thr Ar£ Lys 
500 505 510 

AAC AGC AAG TCC GCG ATG GAG GCA GGA GGA AAC AGT ACT CAA GCT 
Asn Ser Lys Ser Ala Met Gin Ala Gly Gly Asn Ser Scr Gin Ala 
515 520 525 

GCT AAA ACG GAA CAA GTT GAA CAA AGT ATG TTC CTC CAA GGC GAG CGT 
Ala Lys Thr Glu Gin Val Glu Gin Ser Met Phe Leu Gin Gly Glu Arg 
530 535 540 

ACC GAT GAA AAA GAG ATT CCA AAC GAC CAA AAC GTC GTT TAT CGG GGG 
Thr Asp Glu Lys Glu He Pro Asn Asp Gin Asn Val Val Tyr Arg Gly 
545 550 555 560 



1468 



1536 



1584 



1632 
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TCT TGG TAC GGG CAT ATT GCC ACC AGC ACA AGC TGG AGC GGC AAT CCT 
Ser IVrp Tyr Gly His lie Ala Ser Ser Thr Ser Trp Ser Gly Asn Ala 
565 570 575 

TCC AAtNcCA ACG ACT CGC AAC AGG GCG GAA TTT ACT GTG AAT TTC GAT 
Ser Asn Ala Thr Ser Gly Asn Arg Ala Clu Phe Thr Val Asn Phe Asp 
580 585 590 

ACG AAA AAA\ATT AAC GGC ACG TTA ACC GCT GAA AAC AGG CMS GAG GCA 
Thr Lys Lys lie Asn Gly Thr Leu Thr Ale Clu Asn Arg Gin Glu Ale 

595 \ 600 605 

ACC TTT ACC AT* GAT GGT AAG ATT GAG GGC AAC GGT TTT TCC GGT ACG 
Thr Phe Thr Ile\Asp Gly Lys lie Glu Cly Asn Gly Phe Ser Gly Thr 
610 \ 615 620 



GAG 



GCA AAA ACT GCT GAfc TTA GGT TTT GAT CTC GAT CAA AGC AAT ACC ACC 
Ale Lys Thr Ale Aap\Leu Gly Phe Asp Leu Asp Gin Ser Asn Thr Thr 
625 630 635 640 



GCC ACG CCT AAG GCA TAl\ ATC ACA GAT GCC AAG GTG GAG GGC CCT TTT 
Gly Thr Pro Lys Ale TyrXlle Thr Asp Ale Lys Vel Gin Gly Gly Phe 

645 \ 650 655 

TAC GGG CCT AAA GCC GAA GAG TTG GGC CGA TGG TTT GCC TAT CCG GGC 
Tyr Gly Pro Lys Ale Glu Glu>Leu Gly Gly Trp Phe Ale Tyr Pro Gly 
660 \ _665 670 



GAT AAA CAA ACG GAA AAG GCA 
Asp Lys Gin Thr Glu Lys Ale 

675 




GTT GCA TCC GGC GAT GGA AAT TCA 
ml Ale Ser Gly Asp Gly Asn ser 
685 



GCA AGC AGC GCG ACC GTG GTA TTC GGT GCG AAA CGC CAA GAG CCT GTG 
Ale Ser Ser Ale Thr Vel Vel Phe GlyVAla I** Arg Gin Gin Pro Vel 

690 695 \ 700 



CAA TAACXAAAXG AA ft TI G ICTG GGTGGCGGCG GCACGTTCGA TCTTGATTCT 
Gin 
705 

GTCGATACCG AAGCCCCGCG TCCCGCCCCA AAATATCAAG ATGTTTCTTC CGAAAAACCG 
CAAGCCCAAA AAGACCAAGG CGGATACGGT 

(2) INFORMATION POR SBQ ZD NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 705 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 




1726 

1776 

1624 

1872 

1920 

1968 

2016 

2064 

2112 

2165 

2225 
2255 



Cys Leu Gly Gly Gly Gly Thr Phe Asp Leu Asp Ser Val Asp 
15 10 

Ala Pro Arg Pro Ala Pro Lys Tyr Gin Asp Val Ser Ser Glu 
20 25 30 

Gin Ala Gin Lys Asp Gin Gly Gly Tyr Gly Phe Ala Met Arg 
35 40 45 



Thr Glu 
15 

Lys Pro 

\ 

Leu Lys 
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Arg Arg Aan Trp His Pro Gin Ala Asn Pro Lys Glu Asp Glu He Lys 
50 \ 55 60 

Leu Ser Gl\ Asn Asp Trp Glu Ale Thr Gly Leu Pro Gly Asn Pro Lys 
65 \ 70 75 80 

Asn Leu Pro &Lu Arg Gin Lys Ser Val He Glu Lys Val Lys Thr Gly 
65 90 95 

Ser Asp Ser AsnXlle Tyr Ser Ser Pro Tyr Leu Thr Gin Ser Asn His 
100 \ 105 110 

Gin Asn Gly Ser A$a Asn Gin Pro Lys Asn Glu Val Lys Asp Tyr Lys 
115 \ 120 125 

Glu Phe Lys Tyr Val Vyr Ser Gly Trp Phe Tyr Lys His Ale Lys Leu 
130 \ 135 140 

Glu He He Lys Glu As \ Asn Leu He Lys Gly Ala Lys Ser Gly Asp 
145 150\ 155 * 160 

Asp Gly Tyr He Phe Tyr H*is Gly Glu Lys Pro Ser Arg Gin Leu Pro 
165 \ 170 175 



Val Ser Gly Glu Val Thr 
180 



t*ys Gly Val Trp His Phe Val Thr Asp 
185 190 



Thr Lys Gin Gly Gin Lys Phe As>n Asp He Leu Gly Thr Ser Lys Lys 

195 200. 205 

Gin Gly Asp Arg Tyr Ser Gly Phe Vro Gly Asp Asp Gly Glu Glu Tyr 
210 215 \ 220 

Ser Asn Lys Asn Glu Ala Thr Leu Gin Gly Ser Gin Glu Gly Tyr Gly 
225 230 \ 235 240 

\ 

V 

Phe Thr Ser Asn Leu Lys Val Asp Phe Asn Lys Lys Lys Leu Thr Gly 

245 250 255 

Glu Leu He Arg Asn Asn Arg Val Thr Asri Ala Thr Ala Asn Asp Lys 
260 265 270 

\ 

Tyr Thr Thr Gin Tyr Tyr Ser Leu Glu Ala Gin Val Thr Gly Asn Arg 
275 280 285 

Phe Asn Gly Lys Ala Thr Ala Thr Asp Lys Pro\Cly Thr G1 V Glu Thr 
290 295 300 

Lys Gin His Pro Phe Val Ser Asp Ser Ser Ser Leu Ser Gly Gly Phe 
305 310 315 \ 320 

Phe Gly Pro Lys Gly Glu Glu Leu Gly Phe Arg Phe Leu Ser Asn Asp 
325 330 \ 335 

\ 

Gin Lys Val Ala Val Val Gly Ser Ala Lys Thr Gin Xsp Lys Ala Ala 

340 345 \ 350 



Asn Gly Asn Thr Ala Ala Ala Ser Gly Gly Thr Asp Ala Ala Ala Ser 

355 360 365\ 

Asn Gly Ala Ala Gly Thr Ser Ser Glu Asn Ser Lys Leu Thr Thr Val 
370 375 380 
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Leu Asp Ala Val Glu Leu Thr Leu Asn Asp Lys Lys lie Lys Asn Leu 
385 Y 390 395 400 

Asp Ash Phe Ser Asn Ala Ala Gin Leu Val Val Asp Gly He Met He 
\ 405 410 415 

Pro Leu Beu Pro Glu Thr Ser Glu Ser Gly Ser Asn Gin Ala Asp Lys 
\ 420 425 430 

Gly Lys LysXdy hys Asn Gly Lys Asn Gly Gly Thr Asp Phe Thr Tyr 
435 \ 440 445 

Lys Thr Thr Ty\r Thr Pro Lys Asn Asp Asp Lys Asp Thr Lys Ala Gin 
450 \ 455 460 

Thr Gly Ala Ala (Ely Ser Ser Gly Ala Gin Thr Asp Leu Gly Lys Ala 
465 \ 470 475 480 

Asp Val Asn Gly Gl/\Xy* Ala Glu Thr Lys Thr Tyr Glu Val Glu Val 
485 \ 490 . 495 

Cys Cys Ser Asn Leu Ash Tyr Leu Lys Tyr Gly Met Leu Thr Arg Lys 
500 \/) 505 510 

Asn Ser Lys Ser Ala Met JMn Ala Gly Gly Asn Ser Ser Gin Ala Asp 
515 h\ 520 525 

Ala Lys Thr Glu Gin Val GluXGln Ser Met Phe Leu Gin Gly Glu Arg 
530 535 \ 540 

Thr Asp Glu Lys Glu He Pro As\ Asp Gin Asn Val Val Tyr Arg Gly 

545 550 \ 555 560 

Ser Trp Tyr Gly His He Ala Ser Ser Thr Ser Trp Ser Gly Asn Ala 
565 \ 570 575 

Ser Asn Ala Thr Ser Gly Asn Arg Ala\lu Phe Thr Val Asn Phe Asp 
580 585 \ 590 

Thr Lys Lys He Asn Gly Thr Leu Thr Ala, Glu Asn Arg Gin Glu Ala 

595 600 \ 605 

Thr Phe Thr He Asp Gly Lys He Glu Gly Asn Gly Phe Ser Gly Thr 
610 615 \ 620 

Ala Lys Thr Ala Asp Leu Gly Phe Asp Leu Asp^Gln Ser Asn Thr Thr 
625 630 635 \ 640 

Gly Thr Pro Lys Ala Tyr He Thr Asp Ala Lys Va\ Gin Gly Gly Phe 
645 650 \ 655 

Tyr Gly Pro Lys Ala Glu Glu Leu Gly Gly Trp Phe Ala Tyr Pro Gly 
660 665 \ 670 

Asp Lys Gin Thr Glu Lys Ala Thr Val Ala Ser Gly Asft Gly Asn Ser 
675 680 685\ 

Ala Ser Ser Ala Thr Val Val Phe Gly Ala Lys Arg Gin qln Pro Val 
690 695 700 \ 



Gin 
705 



10 



15 



WO 95/33049 - 54 - 

(2) INFO^IATION FOR SEQ ID NO: 7: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2114 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNBSS : single 

(D) TOPOLOGY : linear 



PCT/FR95/00701 



(ii) 
(vi) 

(ix) 

(ix) 

(xi) 



iLSCULE TYPE: DNA (genomic) 

meningitidis 



ORIGINAL SOURCE: 
ORGANISM: N. 
(B)\ STRAIN: 6940 



FBA1 

(A) 
(B) 



LOC 



S/ KEY: mat_peptide 
FLTION: 1..2079 



FEATURE: 

(A) NAME/ J 

(B) LOCATl) 



>N: 



CDS 

1. .2079 



SEQUENCE DESCRIPTION: SEQ ID NO: 7 



TGT TTG CGT CCC CCC G6C ACC 
cys Leu Gly Gly Gly Gly Thr 
1 5 



✓CAT CTT GAT TCT CTC GAT ACC GAA 
Leu Asp Ser Val Asp Thr Glu 
10 15 



CCC CCG CGT CCC GAC CCA AAG TAT CAA GAT GTT TCT TCC GAA AAA CCG 
Ala Pro Arg Pro Asp Pro Lys Tyr Gl\Asp Val Ser Ser Glu Lye Pro 
20 25\ 30 



CAA GCC CAA AAA GAC CAA GCC CGA TAC 
Gin Ala Gin Lys Asp Gin Gly Gly Tyr 
35 40 



CGG AGG AAT TGG TAT TCC CCA GCA AAA GAA 
Arg Arg Asn Trp Tyr Ser Ala Ala Lys Glu 

50 55 



TTT GCG ATG AGG TTG AAA 
GlV Phe Ala Met Arg Leu Lys 
45 

GAG GTT AAA CTG AAC 
Glu Val Lys Leu Asn 
60 



GAG AGT GAT TGG GAG ACC ACA GGA TTG CCG ACA 
Glu Ser Asp Trp Glu Thr Thr Gly Leu Pro Thr Gl> 
65 70 75 

CCA TTA AAA CAA GAA TCC GTC ATT TCA AAA GTA CAA 
Pro Leu Lys Gin Glu Ser Val lie Ser Lys Val Gin 
85 90 



CCC AAG AAA CTG 
Pro Lys Lys Leu 
80 

AAC AAT'GGC 
la Asn Asn Gly 
95 



CAT CAA 
His Gin 



GAC AAC AAT ATT TAC ACT TCC CCC TAT CTC ACG CAA TCA 
Asp Asn Asn He Tyr Thr Ser Pro Tyr Leu Thr Gin Ser 
100 105 

AAT AGC AGC ATT AAT GCC GGT GCA AAC CTG CCA AAA AAC GAA GTA ACA 
Asn Ser Ser He Asn Gly Gly Ala Asn Leu Pro Lys Asn GluWal Thr 
115 120 125 \ 

AAT TAT AAA GAT TTC AAA TAT CTT TAT TCC GGC TGG TTT TAT 
Asn Tyr Lys Asp Phe Lys Tyr Val Tyr Ser Gly Trp Phe Tyr 
130 135 1*0 

GCT AAA AAC GAA ATC ATA AGA GAA AAC AGC TCA ATT AAG GGT GCA> 
Ala Lys Asn Glu He He Arg Glu Asn Ser Ser He Lys Gly Ala 
145 150 155 



\eo 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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AAC CCC CAC^GAC CCC TAT ATC TTT TAT CAC GGC AAA GAA CCT TCC CGA 
Asn Gly Asp Asp Gly Tyr Zle Phe Tyr His Gly Lys Glu Pro Ser Arg 
\. 165 170 175 

GAA CTT CCC CCT TCT GGA ACA GTT ACC TAT AAA GGT GTG TGG CAT TTT 
Gin Leu Pro Ala Ser Gly Thr Val Thr Tyr Lys Gly Val Trp Hi* Phe 
ISO 185 190 



GCG ACC GAT 
Ala Thr Asp 
195 

TCG AAA AAA 
Ser Lys Lys 
210 

GAA CAA TAT 
Glu Gin Tyr 
225 



\ 

GTC AAA AAA TCC CAA 
Val Lys Lys Ser Gin 
200 



\ 



CAA GGC GAC AGG TAT 
Gin Gly Asp\Arg Tyr 
215 

TCT AAT AAA AAC GAA 
Ser Asn Lys Asn^Glu 



230 



\ 



AAT TTT CGC GAT ATT ATC GAG CCT 
Asn Phe Arg Asp lie lie Gin Pro 
205 

AGC GGA TTT TCG GGC GAT GAT GAT 
Ser Gly Phe Ser Gly Asp Asp Asp 
220 

TCC ATG CTG AAA GAT GGT CAA GAG 
Ser Met Leu Lys Asp Gly Gin Glu 
235 240 



GGT TAT GGT TTT ACC TCG AAT TTA GAA GTG GAT TTC GGC ACT AAA AAA 
Gly Tyr Gly Phe Thr Ser Asn Leu Glu Val Asp Phe Gly Ser Lys Lys 
245 \ 250 255 



528 



576 



624 



672 



720 



768 



TTG ACG GGT AAA TTA ATA 
Leu Thr Gly Lys Leu ZXe 
260 



CGC AAT AAT AGA GTT ACA AAC GCT CCT ACT 
Arg Asn Ash Xjrfe Val Thr Asn Ala Pro Thr 

265/^ 270 



AAC GAT AAA TAC ACC ACC GAA TAC TAC AGC CTT GAT GCC CAA ATA ACA 
Asn Asp Lys Tyr Thr Thr Gin Tyr Tyr Ser Leu Asp Ala Gin lie Thr 
275 280 285 



816 



864 



GGC AAC CGC TTC AAC GGT AAG GCG ATA CGG ACC GAC AAA CCC GAC ACT 
Gly Asn Arg Phe Asn Gly Lys Ala Zle Arg Thr Asp Lys Pro Asp Thr 
290 295 300 



912 



GGA GGA ACC AAA CTA CAT CCC TTT GTT TCC GAC TCG TCT TCT TTG AGC 
Gly Gly Thr Lys Leu His Pro Phe Val Ser Asp Ser Ser Ser Leu Ser 
305 310 315 320 



\ 



GGC GGC TTT TTC GGT CCG AAG GGT GAG GAA TTG GGT TTC CGC TTT TTG 
Gly Gly Phe Phe Gly Pro Lys Gly Glu Glu Leu Gly Phe Arg Phe Leu 
325 330 \ 335 



AGC GAC GAT AAA AAA GTT GCG GTT GTC GGC AGC 
Ser Asp Asp Lys Lys Val Ala Val Val Gly Ser 
340 345 



GCG AAA ACC AAA GAC 
Ala Lys Thr Lys Asp 
350 



960 



100B 



1056 



AAA ACG GAA AAT GGC GCG GTG GCT 
Lys Thr Glu Asn Gly Ala Val Ala 
355 360 

TCA AAC GGT GCG GCA GGC ACG TCG 
Ser Asn Gly Ala Ala Gly Thr Ser 

370 375 



TCA GGC GGC 
Ser Gly Gly 



TCT GAA AAC 
Ser Glu Asn 



ACA GAT GCG GCA CCA 
Thr Asp Ala Ala Ala 
365 



ACT AAG 
Ser Lys 
380 



\ 



CTG ACC ACG 
Leu Thr Thr 



1104 



1152 



GTT TTG GAT GCG GTC GAG CTG AAA TTG GGC GAT AAG GAA GTC CAA\ AAG 
Val Leu Asp Ala Val Glu Leu Lys Leu Gly Asp Lys Glu Val GlnYLys 
385 390 395 400 



1200 



CTC GAC AAC TTC AGC AAC CCC GCC CAA CTG GTT GTC GAC GCC ATT ATG 
Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val Val Asp Gly lie Met 
405 410 415 



1246 
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ATT COG CTC TTC CCC GAG GCT TCC GAA ACT GGG AAC AAT CAA GCC AAT 
lie PrV Leu Leu Pro Glu Ala Ser Glu Ser Gly Asn Asn Gin Ala Asn 
420 425 430 

CAA GGT jPlCA AAT GGC GGA ACA GCC TTT ACC CGC AAA TTT GAC CAC ACG 
Gin Gly Thr Asn Gly Gly Thr Al* Phe Thx Arg Lys Phe Asp His Thr 
4« 440 445 

CCG GAA ACl\ GAT AAA AAA GAC GCC CAA GCA GGT ACG CAG ACG AAT GGG 

Pro Glu SerV»P **y» **y* **P cln A1 * cl V Tnx 61(1 Thr A * n GX * 

455 460 



450 

GCG CAA ACC 
Al* Gin Thr 
4 65 



TCA AAT ACG GCA GGT CAT ACC AAT GGC AAA ACA AAA 
Ala\Ser Asn Thr Ala Gly Asp Thr Asn Gly Lys Thr Lys 
470 475 480 



1296 



1344 



1392 



1440 



ACC TAT GAA GTC GAA GTC TGC TGT TCC AAC CTC AAT TAT CTG AAA TAC 
Thr Tyr Glu Val GluXval Cys Cys Ser Asn Leu Asn Tyr Leu Lys Tyr 
485 \ 490 495 



1468 



GGA ATG TTC ACG CCC 
Gly Met Leu Thx Arg 
500 

ACC ACT AGT CAA GCT GAT 
Ser Ser Ser Gin Ala Asp 

515 



AAC ACC AAG TCC GCG ATG CAG GCA GGA GAA 
\Asn Ser Lys Ser Ala Met Gin Ala Gly Glu 

505 510 



AAA ACG GAA CAA GTT GAA CAA AGT ATG 
Ala\Lys Thx Glu Gin Val Glu Gin Ser Met 

525 



TTC CTC CAA GGC GAG CGC ACCy 
Phe Leu Gin Gly Glu Arg Thr 
530 535 



GAA AAA GAG ATT CCA AGC GAG CAA 
Glu Lys Glu lie Pro Ser Glu Gin 
540 



1536 



1584 



1632 



AAC ATC GTT TAX CGG GGG TCT 
Asn lie Val Tyr Arg Gly Ser 

545 550 

AGC ACA AGC TGG AGC GCC AAT 
Ser Thr Ser Trp Ser Gly Asn 

565 

GCG GAA TTT ACT CTG AAT TTT 
Ala Glu Phe Thr Val Asn Phe 
580 

ACC GCT GAC AAC AGG CAG GAG 
Thr Ala Asp Asn Arg Gin Glu 

595 



TGG TAC GGA TAT ATT GCC 
Trp Tyr Gly Tyr lie Ala 

GCT TCC AAT GCA ACG AGT 
Ala Ser Asn Ala Thr Ser 

GCC GAT AAA AAA ATT ACT 
Ala Asp Lys Lys He Thr 

585 \ 

CCA ACC TTT ACC ATT GAT 
Ala Thr Phe Thr lie Asp 

600 \ 605 



AAG GAC AAC GGC TTT GAA GGT ACG GCG AAA ACT GCT 
Lys Asp Asn Gly Phe Glu Gly Thr Ala Lys Thr Ala 
610 615 620 




AAC GAC AAA 
Asn Asp Lys 

560 

GGC AAC ACG 
Gly Asn Arg 
575 

GGT ACG TTA 
Gly Thr Leu 
590 

GGT AAT ATT 
Gly Asn Zle 



TCA GGT TTT 
u Ser Gly Phe 



GAT CTC GAT CAA AGC AAT ACC ACC CGC ACG CCT AAG GCA TAT ATC ACA 
Asp Leu Asp Gin Ser Asn Thr Thx Arg Thr Pro Lys Ala Tyr Zle Thx 

625 630 635 \ 640 

GAT GCC AAG CTG CAG GGC GGT TTT TAC GGG CCC AAA GCC GAA\GAG TTG 
Asp Ala Lys Val Gin Gly Gly Phe Tyr Gly Pro Lys Ala Glu Glu Leu 
645 650 655 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



GGC GGA TGG TTT GCC TAT CCG GGC GAT AAA CAA ACG AAA AAT GCA ACA 
Gly Gly Trp Phe Ala Tyr Pro Gly Asp Lys Gin Thr Lys Asn Ala\Thr 

660 665 670 \ 



2016 
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AAT GCA TtiC GGC AAT AGC AGT CCA ACT CTC CTA TTC CGT CCC AAA CCC 2064 
Asn Ala Set Gly Asn Ser Ser Ala Thr Val Val Phe Gly Ala Lya Arg 
67S . 680 685 

CAA CAG CCT YrTG CGA TAACGCAAGC CCAAAAAGAC CAAGGCGGAT ACGGT 2114 
Gin Gin Pro ^»1 Arg 
690 



(2) INFORMATION FOR SBQ ID NO: 8: 

(i) SBQ0BNCB CHARACTERISTICS: 

(A) ^ENGTH: 693 amino acids 

(B) T5TPB: amino acid 
(D) TOCOLOGY: linear 

(ii) MOLECULfe TYPE: protein 

(xi) SBQUKNCB ^DESCRIPTION: SBQ ID NO: 8: 

Cys Leu Gly Gly Gly Gly\Thr Phe Asp Leu Asp Ser Vftl Asp Thr Glu 

1 5 \ 10 15 



Ala Pro Arg Pro Asp Pro 

20 

Gin Ala Gin Lys Asp Gin 
35 



Pyr Gin Asp Val Ser Ser Glu Lys Pro 

25 30 

Tyr Gly Phe Ala Met Arg Leu Lys 
10 45 



Arg Arg Asn Trp Tyr Ser Ala Ala\ Lys Glu Asp Glu Val Lys Leu Asn 

50 55 \ 60 

Glu Ser Asp Trp Glu Thr Thr Gly Leu Pro Thr Glu Pro Lys Lys Leu 

65 70 \ 75 80 

Pro Leu Lys Gin Glu Ser Val lie Ser \ys Val Gin Ala Asn Asn Gly 

85 1)0 95 

Asp Asn Asn lie Tyr Thr Ser Pro Tyr Leu\ Thr Gin Ser Asn His Gin 
100 105 \ 110 

Asn Ser Ser He Asn Gly Gly Ala Asn Leu pVo Lys Asn Glu Val Thr 

115 120 \ 125 

Asn Tyr Lys Asp Phe Lys Tyr Val Tyr Ser GlyVTrp Phe Tyr Lys 'His 

130 135 140 



Ala Lys Asn Glu He He Arg Glu Asn Ser Ser II* Lys Gly Ala Lys 
145 150 155 \ 160 

Asn Gly Asp Asp Gly Tyr He Phe Tyr His Gly Lys >Glu Pro Ser Arg 
165 170 \ 175 

Gin Leu Pro Ala Ser Gly Thr Val Thr Tyr Lys Gly vAl Trp His Phe 

180 185 \ 190 

Ala Thr Asp Val Lys Lys Ser Gin Asn Phe Arg Asp Ilc\tle Gin Pro 
195 200 205 

Ser Lys Lys Gin Gly Asp Arg Tyr Ser Gly Phe Ser Gly Asn Asp Asp 
210 215 220 

Glu Gin Tyr Ser Asn Lys Asn Glu Ser Met Leu Lys Asp Gly\Gln Glu 
225 230 235 \ 240 
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Gly Tjte Gly Phe Thr Ser Asn Leu Glu Vai Asp Phc Gly Ser Lys Lys 
245 250 255 

Leu ThrWy Lys Leu lie Arg Asn Asn Arg Val Thr Asn Ala Pro Thr 
X 260 265 270 

Asn Asp LVs Tyr Thr Thr Gin Tyr Tyr Scr Leu Asp Ala Gin lie Thr 
275 280 285 

Gly Asn ArA Phe Asn Gly Lys Ala He Arg Thr Asp Lys Pro Asp Thr 

290 \ 295 300 

Gly Gly Thr kys Leu His Pro Phe Val Ser Asp Ser Ser Ser Leu Ser 

305 \ 310 315 320 

Gly Gly Phe Phe\ Gly Pro Lys Gly Glu Glu Leu Gly Phe Arg Phe Leu 
\325 330 335 

Ser Asp Asp Lys LV* Val Ala Val Val Gly Ser Ala Lys Thr Lys Asp 

340 \ 345 * 350 

Lys Thr Glu Asn GlyVla Val Ala Ser Gly Gly Thr Asp Ala Ala Ala 
355 \ . 360 365 

Ser Asn Gly Ala Ala G^-JThr Ser Ser Glu Asn Ser Lys Leu Thr Thr 
370 Z^(375 380 

Val Leu Asp Ala Val Glu ^eu Lys Leu Gly Asp Lys Glu Val Gin Lys 
385 390 A 395 400 

Leu Asp Asn Phe Ser Asn AlaXAla Gin Leu Val Val Asp Gly He Met 
405 \ 410 415 

He Pro Leu Leu Pro Glu Ala ^er Glu Ser Gly Asn Asn Gin Ala Asn 
420 \ 425 430 

Gin Gly Thr Asn Gly Gly Thr Ala\phe Thr Arg Lys Phe Asp His Thr 
435 440 \ 445 

Pro Glu Ser Asp Lys Lys Asp Ala G^n Ala Gly Thr Gin Thr Asn Gly 
450 455 \ 460 

Ala Gin Thr Ala Ser Asn Thr Ala Gly W Thr Asn Gly Lys Thr Lys 
465 470 \ 475 * 480 

Thr Tyr Glu Val Glu Val Cya Cys Ser Asn\Leu Asn Tyr Leu Lys Tyr 
485 490 \ 495 

Gly Met Leu Thr Arg Lys Asn Ser Lys Ser A^ Met Gin Ala Gly Glu 

500 505 \ 510 

Ser Ser Ser Gin Ala Asp Ala Lys Thr Glu Gin Val Glu Gin Ser Met 
515 520 \ 525 

Phe Leu Gin Gly Glu Arg Thr Asp Glu Lys Glu IlV Pro Ser Glu Gin 
530 535 54f 

Asn He Val Tyr Arg Gly Ser Trp Tyr Gly Tyr IleWa Asn Asp Lys 

545 550 555 \ 560 

Ser Thr Ser Trp Ser Gly Asn Ala Ser Asn Ala Thr sAx Gly Asn Arg 
565 570 \ 575 
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10 



15 



Ala \Uu Phe Thr Val Asn Phe Ala Asp Lys Lys lie Thr Gly Thr Leu 
580 585 590 

Thr Ala\Asp Asn Arg Gin Glu Ala Thr Phe Thr lie Asp Gly Asn lie 
>95 600 60S 

Lys Asp As\n Gly Phe Glu Gly Thr Ala Lys Thr Ala Glu Ser Gly Phe 
610 \ 615 620 

Asp Leu Asp Ser Asn Thr Thr Arg Thr Pro Lys Ala Tyr lie Thr 

625 \ 630 635 640 

Asp Ala Lys Vai\cin Gly Gly Phe Tyr Gly Pro Lys Ala Glu Glu Leu 
MS 650 655 

Gly Gly Trp Phe Ala. Tyr Pro Gly Asp Lys Gin Thr Lys Asn Ala Thr 
660 A 665 670 

Asn Ala Ser Gly Asn sVr Ser Ala Thr Val Val Phe Gly Ala Lys Arg 
675 \ 680 .685 

Gin Gin Pro Val Arg 
690 



(2) INFORMATION FOR SEQ ID 



(i) 

(ii) 
(vi) 

(ix) 

(ix) 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2114 base pairs 

(B) TYPE: nucleic Vcid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY : linear 



MOLECULE TYPE: DNA (genomic) 

;idis 



ORIGINAL SOURCE: 

(A) ORGANISM: N. menin 

(B) STRAIN: S3032 



FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..2097 

FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2097 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9i 

TGT TTG GGC GGA GGC GGC GGC ACT TTC GAT CTT GAT TCT GT& GAT ACC 48 
Cys Leu Gly Gly Gly Gly Gly Ser Phe Asp Leu Asp Ser V»l\Asp Thr 
15 10 \l5 

GAA GCC CCG CGT CCC GCG CCA AAG TAT CAA GAT CTT TCT TCC OLA AAA 96 
Glu Ala Pro Arg Pro Ala Pro Lys Tyr Gin Asp Val Ser Ser GJAi Lys 
20 25 30 

CCG CAA GCC CAA AAA GAC CAA GGC GGA TAC GGT TTT GCG ATG AGG >TTC 144 
Pro Gin Ala Gin Lys Asp Gin Gly Gly Tyr Gly Phe Ala Met Arg Aeu 
35 40 45 



AAA CGG AGG AAT TCC TAT CCG TCG GGA AAA GAA AAC GAG GTT AAA CT 
Lys Arg Arg Asn Trp Tyr Pro Ser Ala Lys Glu Asn Glu Val Lys Leu\ 
50 55 60 
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AAT GAG V AGT GAT TCG GAG ACG ACA CGA TTG CCA AGC AAT CCC AAA AAC 
Asn Glu^er Asp Trp Glu Thr Thr Gly Leu Fro Ser Asn Pro Lys Asn 
\ 70 75 80 



65 



TTA CCT GA< 
Leu Pro Glu 



CAG AAA TCG GTT ATT GAT CAA G7A GAA ACA GAT GGC 
eg Gin Lys Ser Val He Asp Gin Val Glu Thr Asp Gly 
B5 90 95 



GAC AGC AAT AAfc AGC AAT ATT TAT TCT TCC CCC TAT CTC ACG CAA TCA 
Asp Ser Asn AsnXser Asn He Tyr Ser Ser Pro Tyr Leu Thr Gin Ser 
100 \ 105 110 



AAC CAT CAA AAC 
Asn His Gin Asn 

115 



AAC ACT GGC AAC GGT GTA AAC CAA CCA AAA AAC 
Asn Thr Gly Asn Gly Vel Asn Gin Pro Lys Asn 

120 125 




GAA GTA ACA GAT TAC 
Glu Val Thr Asp Tyr 
130 



AAT TTT AAA TAT GTT TAT TCC GGC TGG TTT 
Asn Phe Lys Tyr Val Tyr Ser Gly Trp Phe 
140 _ 



TAC AAA CAC GCC AAA CGA 
Tyr Lys His Ala Lys Arg 
145 150 

GCA AAA AAC GGC GAC GAC GGT 
Ala Lys Asn Gly Asp Asp 
165 



GTT AAC TTA GCG GTG GAA CCT AAA ATT 
Vel Asn Leu Ala val Glu Pro Lys lie 
155 160 



ATC TTC TAT CAC GGT AAA GAC CCT 
He Phe Tyr His Gly Lys Asp Pro 
170 175 



TCC CGA CAA CTT CCC CCT TCT GGA AAA ATT ACC TAT AAA GGT GTG TCG 
Ser Arg Gin Leu Pro Ala Ser Gly iiys He Thr Tyr Lys Gly Vel Trp 
180 IBS 190 



r 



CAT TTT GCG ACC GAT ACA AAA AGG GGT X CAA AAA TTT CGT GAA ATT ATC 
His Phe Ale Thr Asp Thr Lys Arg Gly Gin Lys Phe Arg Glu Zle He 
195 200 \ 



240 



288 



336 



364 



432 



480 



528 



576 



624 



205 



CAA CCT TCA AAA AAT CAA GGC GAC AGA TATNAGC GGA TTT TCG CGT GAT 
Gin Pro Ser Lys Asn Gin Gly Asp Arg Tyr Ser Gly Phe Ser Gly Asp 
210 215 \ 220 



672 



GAT GAT GAA CAA TAT TCT AAT AAA AAC GAA TCC\ATG CTG AAA GAT GGT 
Asp Asp Glu Gin Tyr Ser Asn Lys Asn Glu Ser Met Leu Lys Asp Gly 
225 230 235 \ 240 



720 



CAT GAA GGT TAT GGT TTT GCC TCG AAT TTA GAA GTG\GAT TTC GAC AAT 
His Glu Gly Tyr Gly Phe Ala Ser Asn Leu Glu Vel Asp Phe Asp Asn 
245 250 \ 255 



768 



AAA AAA TTG ACG GGT AAA TTA ATA CGC AAT AAT GCG AAC\ CAA AAT AAT 
Lys Lys Leu Thr Gly Lys Leu He Arg Asn Asn Ala Asn Gin Asn Asn 
260 265 2^0 

AAT ACT AAT AAT GAC AAA CAC ACC ACC CAA TAC TAC AGC CTT GAT GCG 
Asn Thr Asn Asn Asp Lys His Thr Thr Gin Tyr Tyr Ser LeuYAsp Ala 
275 280 285 \ 



816 



864 



ACC CTT AAG GGA AAC CGC TTC AGC GGA AAA GCG GAA GCA ACC GAC AAA 
Thr Leu Lys Gly Asn Arg Phe Ser Gly Lys Ala Glu Ala Thr Asp Lys 



290 



295 



300 



CCC AAA AAC GAC GGC GAA ACC AAG 
Pro Lys Asn Asp Gly Glu Thr Lys 
305 310 



GAA CAT 
Glu His 



CCC 
Pro 
315 



TTT GTT TCC 
Phe Val Ser Asp 



\ 



GAC TCG 
Ser 



320 



912 



960 
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TCT 
Ser 



tct rrc 

Ser Leu 



AGC CCC CCC TTT TTC 
Scr Gly Gly Phe Phe 
325 



TTC CGC TTT 
Phe Arg Pfae 



rG AGC AAC GAT CAA AAA GTT GCC GTT GTC GGC AGC GCG 
Ser Asn Asp Gin Lys Val Ala Val Val Gly Ser Ala 
345 350 



AAA ACC AAA GAC 
Lys Thr Lys Asp 

355 



GGC ACA GAT GCG GCA 
Gly Thr Asp Ala Ala 
370 



GGC ACA GCA ATC 
Gly Thr Ala He 




GGC CCG CAG 
Gly Pro Gin 
330 



GGT GAG GAA TTG GGT 
Gly Glu Glu Uu Gly 
335 



CCC GCA AAT GGC AAT ACT GCG GAG GCT TCA GGC 
Pro Ala Asn Gly Asn Thr Al* Glu Ala Ser Gly 

360 365 



TCG GGC GCT GCG GCA GGC ACG TCG TCT GAA 
a Ser Gly Gly Ala Ala Gly Thx Ser Ser Glu 
375 380 



AAC ACT AAG CTG ACC AC& GTT TTG GAT GCG GTC GAG CTG ACG CAC GGC 
Asn Sar Lys Leu Thr ThrWal Lau Asp Ala Val Glu Leu Thr His Gly 
385 390 \ 395 400 



AAA AAT 
Lys Asn 

405 



GAC AAC TTC AGC AAT GCC GCC CAA CTG 
Asn Phe Sar Asn Ala Ala Gin Lau 

410 415 



1008 



1056 



1104 



1152 



1200 



1248 



GTT GTC GAC GGC ATT ATG ATT 
Val Val Asp Gly lie Met He 
420 



CTG CCT CAA AAT TCA ACA GGC 
Lau Pro Gin Asn Sar Thr Gly 
430 



1296 



AAA AAT AAT CAG CCC GAT CAA GGT AAA AAC GGC GGA ACA GCC TTT ATC 
Lys Asn Asn Gin Pro Asp Gin Gly Lya\Asn Gly Gly Thr Ala Pha He 
435 440 \ 445 



1344 



TAT AAA ACG ACC TAC ACG CCG AAA AAC 
Tyr Lys Thr Thr Tyr Thr Pro Lys Asn Asp 
450 455 



GAC AAA GAT ACC AAA GCC 
Asp Lys Asp Thr Lys Ala 
460 



1392 



CAA ACA GTC ACG GGC GGC ACG CAA ACC GCT TCA AAT ACG GCA GGT GAT 
Gin Thr Val Thr Gly Gly Thx Gin Thr Ala SeV Asn Thr Ala Gly Asp 
465 470 47Sv 480 



1440 



GCC AAT GGC AAA ACA AAA ACC TAT GAA GTC GAA 6TC TGC TGT TCC AAC 
Ala Asn Gly Lys Thr Lys Thr Tyr Glu Val Glu Val Cys Cys Sar Asn 
485 490 \ 495 



1488 



CTC AAT TAT CTG AAA TAC GGG TTG CTG ACG CGC AAA )^CT GCC GGC AAC 
Leu Asn Tyr Lau Lys Tyr Gly Leu Leu Thr Arg Lys Thr Ala Gly Asn 

500 505 \ 510 



1536 



ACG GTG GGA AGC GGC AAC AGC AGC CCA ACC GCC GCC GCC ^CAA ACG GAC 
Thr Val Gly Ser Gly Asn Ser Ser Pro Thr Ala Ala Ala Gin Thr Asp 
515 520 525 



1584 



GCG CAG ACT ATG TTC CTC CAA GGC GAG CGC ACC GAT GAA AACV AAG ATT 
Ala Gin Ser Met Phe Leu Gin Gly Glu Arg Thr Asp Glu Asn \ys He 
530 535 540 



1632 



CCA AGC GAG CAA AAC GTC GTT TAT CGG GGG TCT TGG TAC GGG CA* ATT 
Pro Sax Glu Gin Asn Val Val Tyr Arg Gly Ser Trp Tyr Gly Hi»s\ He 
545 550 555 \560 



1680 



GCC AGC AGC ACA AGC TGG AGC GGC AAT GCT TCT GAT AAA GAG GGC GGC 
Ala Ser Ser Thr Ser Trp Ser Gly Asn Ala Ser Asp Lys Glu Gly Gmr 
565 570 575 \ 



1728 
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\ 

AAC AGG GCg\»A TTT ACT GTG AAT TTT GGC GAG AAA AAA ATT ACC GGC 1776 
Asn Arg Ala Glu Phe Thr Val Asn Phe Gly Glu Lye Ly« He Thr Gly 
SSo 5B5 590 

ACG TTA ACC GCT GAA AAC AGG CAG GAG GCA ACC TTT ACC ATT GAT GGT 1824 
Thr Leu Thx Ala\Glu Asn Arg Gin Glu Ale Thr Phe Thr He Asp Gly 
595 \ 600 605 

AAG ATT GAG GGC AAC GGT TTT TCC GGT ACG GCA AAA ACT GCT GAA TTA 1872 
Lys He Glu Gly Asn Gly Phe Ser Gly Thr Ale Lye Thr Ala Glu Leu 

€10 \ 615 620 

GGT TTT GAT CTC GAT V CAA AAA AAT ACC ACC CGC ACG CCT AAG GCA TAT 1920 
Gly Phe Asp Leu Asp Gin Lys Asn Thr Thr Arg Thr Pro Lys Ala Tyr 
625 630 635 640 

ATC ACA GAT GCC AAG GTA AAG GGC GGT TTT TAC GGG CCC AAA CCC GAA 1968 
He Thr Asp Ala Lys Val Lys Gly Gly Phe Tyr Gly Pro Lys Ala Glu 
645 650 655 

GAG TTG GGC GGA TGG TTT GCC TAT TCG GAC GAT AAA CAA ACG AAA AAT 2016 
Glu Leu Gly Gly Trp Phe Ala Tyr Ser Asp Asp Lys Gin Thr Lys Asn 
660 665 670 

GCA ACA GAT GCA TCC GGC AAT GGA AAT TCA GCA AGC ACT GCA ACT GTC 2064 
Ala Thr Asp Ala Ser Gly Asn Gly Asn Ser Ala Ser Ser Ala Thx Val 
675 680 \ 685 

GTA TTC GGT GCG AAA CGC CAA CAG CC^CTC CAA TAAACCAAGC CCGATAC 2114 
Val Phe Gly Ala Lys Arg Gin Gin Pnby^l Gin 
690 695 r 



\ 

(2) INFORMATION FOR SBQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein \ 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 10: 

Cys Leu Gly Gly Gly Gly Gly Ser Phe Asp Leu Asp Ser Val Asp Thr 

1 5 10 15 

Glu Ala Pro Arg Pro Ala Pro Lys Tyr Gin Asp Val Ser Ser Glu Lys 

20 25 30 

Pro Gin Ala Gin Lya Asp Gin Gly Gly Tyr Gly Phe Ala Met Arg Leu 
35 40 45 

Lys Arg Arg Asn Trp Tyr Pro Ser Ala Lys Glu Asn Glu Val Lys Leu 

50 55 60 

Asn Glu Ser Asp Trp Glu Thr Thr Gly Leu Pro Ser Asn Pro Lys Asn 

65 70 # 75 \ BO 

Leu Pro Glu Arg Gin Lys Ser Val He Asp Gin Val Glu Thr Asp Gly 
85 90 95 

\ 



\ 

v 
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Asp\er Asn Asn Ser Asn lie Tyr Ser Ser Pro Tyr Leu Thr Gin Ser 

^ 100 105 . 110 

Asn HisXcin Asn Gly Asn Thr Gly Asn Gly Val Asn Gin Pro Lys Asn 

U5 120 125 

Glu Val Th\Asp Tyr Lys Asn Phc Lys Tyr Val Tyr Ser Gly Trp Phe 

130 \ 135 140 

Tyr Lys His Al\ Lys Arg Glu Val Asn Leu Ala Val Glu Pro Lys He 
145 \ 150 155 I" 

Ala Lys Asn Gly Asp Asp Gly Tyr He Phe Tyr His Gly Lys Asp Pro 
165l 170 "5 

Ser Arg Gin Leu Pro Ala Ser Gly Lys He Thr Tyr Lys Gly Val Trp 
180 \ 1*5 190 

His Phe Ala Thr Asp Thr^Lys Arg Gly Gin Lys Phe Arg Glu He He 

195 \ 200 205 



Gin Pro Ser Lys Asn Gin Gly Asp Arg Tyr Ser Gly Phe Ser Gly Asp 

210 215 \ ^ 220 



Asp Asp Glu Gin Tyr S 
225 230 



tr Asn Ly^As 

30 -r\ 



n Glu Ser Met Leu Lys Asp Gly 
235 240 



His Glu Gly Tyr Gly Phe Ala Ser Asn Leu Glu Val Asp Phe Asp Asn 

245 \250 255 

Lys Lys Leu Thr Gly Lys Leu He Arg Asn Asn Ala Asn Gin Asn Asn 



260 



265 



As^* 



270 



Asn Thr Asn Asn Asp Lys His Thr Thr Gin Tyr Tyr Ser Leu Asp Ala 
275 280 \ 285 

Thr Leu Lys Gly Asn Arg Phe Ser Gly Lys Ala\Glu Ala Thr Asp Lys 
290 295 300 



Pro Lys Asn Asp Gly Glu Thr Lys Glu His Pro Phe Val Ser Asp Ser 

305 310 315 \ 320 

Ser Ser Leu Ser Gly Gly Phe Phe Gly Pro Gin Gly Glu Glu Leu Gly 
325 330 \ 335 

Phe Arg Phe Leu Ser Asn Asp Gin Lys Val Ala Val Val\Gly Ser Ala 
340 345 950 

Lys Thr Lys Asp Lys Pro Ala Asn Gly Asn Thr Ala Glu A^e Ser Gly 
355 360 365 

Gly Thr Asp Ala Ala Ala Ser Gly Gly Ala Ala Gly Thr SerNser Glu 
370 375 380 

Asn Ser Lys Leu Thr Thr Val Leu Asp Ala Val Glu Leu Thr H^s Gly 
365 390 395 \ 400 

Gly Thr Ala He Lys Asn Leu Asp Asn Phe Ser Asn Ala Ala Gin "Leu 
405 410 415 x 

Val Val Asp Gly He Met He Pro Leu Leu Pro Gin Asn Ser Thr GI3 
420 425 430 
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Lye Asn Asn Gin Pro Asp Gin Gly Lys Asn Gly Gly Thr Ala Phe lie 
\ 435 440 445 

Tyr\ys Thr Thr Tyr Thr Pro Lys Asn Asp Asp Lys Asp Thr Lys Ala 
450 455 460 

Gin ThrXVal Thr Gly Gly Thr Gin Thr Ala Scr Asn Thr Ala Gly Asp 
465 \ 470 475 480 

Ala Asn Glw Lys Thr Lys Thr Tyr Glu Val Glu Val Cys Cys Ser Asn 
\ 485 490 495 

Leu Asn Tyr Leu Lys Tyr Gly Leu Leu Thr Arg Lys Thr Ala Gly Asn 

900 505 510 

Thr Val Gly Se\ Gly Asn Ser Ser Pro Thr Ala Ala Ala Gin Thr Asp 
515 \ 520 525 

Ala Gin Ser Met p\e Leu Gin Gly Glu Arg Thr Asp Glu Asn Lys He 
530 \ 535 540 

Pro Ser Glu Gin Asn \tal Val T * r G1 V Ser T *P G1 y Hia Ile 

545 5M 555 560 

Ala Ser Ser Thr Ser Txrp\& Gly Asn Ala Ser Asp Lys Glu Gly Gly 

565 HSq 570 575 

Asn Arg Ala Glu Phe Thr VaXtAsn Phe Gly Glu Lys Lys He Thr Gly 
580 \ 585 590 

Thr Leu Thr Ala Glu Asn Arg Gin Glu Ala Thr Phe Thr lie Asp Gly 
595 60(1 605 

Lys He Glu Gly Asn Gly Phe Ser (Sly Thr Ala Lys Thr Ala Glu Leu 
610 615 \ 620 

Gly Phe Asp Leu Asp Gin Lys Asn Thr\Thr Arg Thr Pro Lys Ala Tyr 
625 630 \ 635 640 

Ile Thr Asp Ala Lys Val Lys Gly Gly Pfte Tyr Gly Pro Lys Ala Glu 
645 650, 655 

Glu Leu Gly Gly Trp Phe Ala Tyr Ser Asp Asp Lys Gin Thr Lys Asn 
660 665 \ 670 

Ala Thr Asp Ala Ser Gly Asn Gly Asn Ser AlaXser Ser Ala Thr Val 
675 680 \ 685 

Val Phe Gly Ala Lys Arg Gin Gin Pro Val Gin \ 

690 695 \ 



(2) INFORMATION FOR SEQ ID NO: 11: 



5 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: IM2169 



10 



<xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



WO 95/33049 - 65 - PCT/FR95/00701 



Thr Lyi Asp Lys Leu Glu Asn Gly Ala Ala Ala Ser Gly Ser Thr Gly 

1 A 5 10 15 

Ala Ala Ala Ser Gly Gly Ala Ala Gly Thr Ser Ser Glu Asn Ser Lys 
20 25 30 

Leu Thr ThAval Leu Asp Ala Val Glu Leu Thr Leu Asn Asp Lys Lys 
35 \ 40 45 

lie Lys Asn l\u Asp Asn Phe Ser Asn Ala Ala Gin Leu Val Val Asp 

50 \ 55 60 

Gly Zle Met IleXpro Leu Leu Pro Lys Asp Ser Glu Ser Gly Asn Thr 

65 \ 70 75 80 

Gin Ala Asp Lys Gly Lys Asn Gly Gly Thr Glu Phe Thr Arg Lys Phe 

85 y 90 95 

Glu His Thr Pro Glu Ser Asp Lys Lys Asp Ala Gin Ala Gly Thr Gin 
100 \ 105 110 

Thr Asn Gly Ala Gin Thr\Ala Ser Asn Thr Ala Gly Asp Thr Asn Gly 

115 \ 120 125 

Lys Thr Lys Thr Tyr Glu Va^jf/dlu Val Cys Cys Ser Asn Leu Asn Tyr 
130 13 £y 7 140 

Leu Lys Tyr Gly Met Leu Thr Arg Lys Asn Ser Lys Ser Ala Met Gin 
145 150 \ 155 160 

Ala Gly Gly Asn Ser Ser Gin Ala Asp Ala Lys Thr Glu Gin Val Glu 
165 \ 170 175 



Gin Ser Met Phe Leu Gin Gly Glu Arg Thr Asp Glu Lys Glu He Pro 



180 



185 x 



190 



Thr Asp Gin Asn Val Val 
195 



10 



(2) INFORMATION FOR SBQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 198 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: 6940 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Thr Lys Asp Lys Thr Glu Asn Gly Ala Val Ala Ser Gly Gly Thr Asp 



10 



15 



Ala Ala Ala Ser Asn Gly Ala Ala Gly Thr Ser Ser Glu AsnYser Lys 
20 25 30 



WO 95/33049 



- 66 - 



PCT/FR95/00701 



Leu Th\f Thr Val Leu Asp Ala Val Glu Leu Lys Leu Gly Asp Lys Glu 
35 40 45 

Val Gin ^vs Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val Val Asp 

50 \ 55 60 

Gly He Met\le Pro Leu Leu Pro Glu Ala Ser Glu Ser Gly Asn Asn 

65 \ 70 75 80 

Gin Ala Asn Gln\Gly Thr Asn Gly Gly Thr Ala Phe Thr Arg Lys Phe 

^5 90 95 

Asp His Thr Pro Glu Ser Asp Lys Lys Asp Ala Gin Ala Gly Thr Gin 

100 \ 105 110 

Thr Asn Gly Ala Gin Thr Ala Ser Asn Thr Ala Gly Asp Thr Asn Gly 

115 \ 120 125 



Lys Thr Lys Thr Tyr Glu yal Glu Val Cys Cys Ser Asn Leu Asn Tyr 
130 135 140 



Leu Lys Tyr Gly Net Leu Thr x Arg Lys Asn Ser Lys Ser Ala Met Gin 

145 150 \ 155 160 



Ala Gly Glu Ser Ser Ser Gin Ala\Asp Ala Lys Thr Glu Gin Val Glu 

165 \ 170 175 



10 



Gin Ser Met Phe Leu Gin Gly Glu Arg Thr Asp Glu Lys Glu He Pro 

180 185 \/? X9 ° 

Ser Glu Gin Asn Zle Val 
195 



(2) INFORMATION FOR SEQ ZD NO: 13 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: 2223 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Thr Lys Asp Lys Thr Glu Asn Gly Ala Val Ala Ser Gly 
15 10 




Thr Asp 
15 

Ala Ala Ala Ser Asn Gly Ala Ala Gly Thr Ser Ser Glu Asn Ser Lys 

20 25 30 \ 



Leu Thr Thr Val Leu Asp Ala Val Glu Leu Lys Leu Gly Asp Lys\Glu 
35 40 45 \ 

Val Gin Lys Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val Val Asp 
50 55 60 



Gly He Met He Pro Leu Leu Pro Glu Ala Ser Glu Ser Gly Asn Asn 
65 70 75 80 



WO 95/33049 



67 
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Gin Vila Asn Gin Gly Thr Asn Gly Gly Thr Ala Phe Thr Arg Lys Phe 
85 90 95 

Asp Hi\ Thr Pro Glu Ser Asp Lys Lys Asp Ala Gin Ala Gly Thr Gin 
100 105 110 

Ala Asn (Jly Ala Gin Thr Ala Ser Asn Thr Ala Gly Asp Thr Asn Gly 
ll£ 120 125 

Lys Thr Lys \hr Tyr Glu Val Glu Val Cys Cys Ser Asn Leu Asn Tyr 
130 \ 135 140 

Leu Lys Tyr Gl^Met Leu Thr Arg Lys Asn Ser Lys Ser Ala Met Gin 
145 \ 150 155 160 

Ala Gly Glu Ser Se\r Ser Gin Ala Asp Ala Lys Thr Glu Gin Val Gly 
Itk 170 175 

Gin Ser Met Phe Leu &ln Gly Glu Arg Thr Asp Glu Lys Glu lie Pro 
180 \ 185 190 

Ser Glu Gin Asn lie Val' 
195 



10 



(2) INFORMATION FOR SBQ ID WO: 14 J 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 apino acids 

(B) TYPE: amino 
(D) TOPOLOGY: linea 

(ii) MOLECULE TYPE: peptic 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: C708 

(xi) SEQUENCE DESCRIPTION: SEQ Yd NO: 14: 



Thr Gin Aap Lys Pro Arg Asn Gly Ala Val Alk Ser Gly Gly Thr Gly 
1 5 10 \ 15 

Ala Ala Arg Ser Asn Gly Ala Ala Gly Gin Ser s\r Glu Asn Ser Lya 

20 25 \ 30 

Leu Thr Thr Val Leu Asp Ala Val Glu Leu Thr Leu Asp Lys Lys 

35 40 



lie Lys Asn Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu. Val Val Asp 
50 55 60 

Gly lie Met lie Pro Leu Leu Pro Glu Ala Ser Glu Ser <&y Lys Asn 
65 70 75 \ 80 

Gin Ala Asn Gin Gly Thr Asn Gly Gly Thr Ala Phe Thr Arg\Lys Phe 
85 90 

Asn His Thr Pro Lys Ser Asp Glu Lys Asp Thr Gin Ala Gly Tnx Ala 
100 105 110 

Glu Asn Gly Asn Pro Ala Ala Ser Asn Thr Ala Gly Asp Ala Asn^Gly 
115 120 125 



WO 95/33049 - 68 - PCT/FR95/00701 

V 

Lvs Thr L^s Thr Tyr Glu Val Glu Val Cys Cys Ser Asn Leu Asn Tyr 
130 135 140 

Leu Lys Tyr\Gly Met Leu Thr Arg Lys Asn Ser Lys Ser Ala Met Gin 
145 \ 150 155 160 

Ala Gly Glu Ser Ser Ser Gin Ala Asp Ala Lys Thr Glu Gin Val Gly 

\ 165 170 175 

Gin Ser Met Phe\leu Gin Gly Glu Arg Thr Asp Glu Lys Glu He Pro 
180 ^ 185 190 

Asn Asp Gin Asn Val Val 
195 

\ 

\ 

\ 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SBQUKNCB CHARACTERISTICS : 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPEj^eptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N.\ meningitidis 

(B) STRAIN: M978 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Thr Gin Asp Lys Ala Ala Asn Gly Asn Thr Ala Ala Ala Ser Gly Gly 

1 5 \ 10 15 

Thr Asp Ala Ala Ala Ser Asn Gly Ala Ala Gly Thr Ser Sar Glu Asn 

20 25 \ 30 

Ser Lys Leu Thr Thr Val Leu Asp Ala Val Glu Leu Thr Leu Asn Asp 
35 40 \ 45 

Lvs Lys lie Lys Asn Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val 

50 55 \ €0 

Val Asp Gly lie Met He Pro Leu Leu Pro Glu Thr Ser Glu Ser Gly 
65 70 75 * 80 

Ser Asn Gin Ala Asp Lys Gly Lys Lys Gly Lys Asn Gly Lys Asn Gly 
65 90 95 

Gly Thr Asp Phe Thr Tyr Lys Thr Thr Tyr Thr Pro Lys Asn Asp Asp 

100 105 \ HO 

Lys Asp Thr Lys Ala Gin Thr Gly Ala Ala Gly Ser Ser Gly Ala Gin 
115 120 125 

Thr Asp Leu Gly Lys Ala Asp Val Asn Gly Gly Lys Ala Glu Thr Lys 
130 135 140 \ 

Thr Tyr Glu Val Glu Val Cys Cys Ser Asn Leu Asn Tyr Leu Lys Tyr 
145 150 155 \ 160 

Glv Met Leu Thr Arg Lys Asn Ser Lys Ser Ala Met Gin Ala Gly Gly 

165 170 \ 175 



WO 95/33049 



69 
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Asn Ser Se\r Gin Ala Asp Ala Lys Thr Glu Gin Val Glu Gin Ser Met 
180 185 190 

Phe Leu Gin \ly Glu Arg Thr Asp Glu Lys Glu lie Pro Asn Asp Gin 
195 \ 200 205 

Asn Val Val 
210 
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(2) INFORMATION FOR\SBQ ID NO: 16: 

(1) SEQUENCE ^CHARACTERISTICS : 

(A) LENGTHS 200 amino acids 

(B) TYPE: Vmino acid 
(D) TOPOLOGYV: linear 

(ii) MOLECULE TYPBfv peptide 

(vi) ORIGINAL SOURCb\ 

(A) ORGANISM: N. ^meningitidis 

(B) STRAIN: 1610 

(xi) SEQUENCE DESCRIPTfoi^: SEQ ID NO: 16: 



Lys Arg Asp Lys Ala Glu Ser Gly Gly >Gly Asn Gly Ala 
15 

Thr Asp Ala Ala Ala Ser Asn Gly Ala Ala\ Gly Thr Ser 
20 25 

Ser Lys Leu Thr Thr Val Leu Asp Ala Val gIaj Leu Lys 
35 40 \ 45 



Lys Glu Val Lys Asn Leu Asp Asn Phe Ser Asn Ala Ala 

50 55 

Val Asp Gly He Met He Pro Leu Leu Pro Lys As & Ser 
65 70 75 

Asn Thr Gin Ala Asp Lys Gly Lys Asn Gly Gly Thr 
85 90 

Lys Phe Glu His Thr Pro Glu Ser Asp Lys Lys Asp Ala 
100 105 

Thr Gin Thr Asn Gly Ala Gin Thr Ala Ser Asn Thr Ala 
115 120 125 

Asn Gly Lys Thr Lys Thr Tyr Glu Val Glu Val Cys Cys 
130 135 140 

Asn Tyr Leu Lys Tyr Gly Leu Leu Thr Arg Lys Thr Ala 
145 150 155 

Gly Glu Gly Gly Asn Gly Ser Gin Thr Ala Ala Ala Gin 
165 170 

Gly Ala Gin Ser Met Phe Leu Gin Gly Glu Arg Thr Asp 
180 185 

He Pro Ser Glu Gin Asn Val Val 
195 200 



Ser Gly Gly 
15 

Ser Glu Asn 
30 

Ser Gly Gly 

Gin Leu Val 

Glu Ser Gly 
80 

Phe Thr Arg 
95 

^Gln Ala Gly 
L10 

Gly Asp Thr 
SerVAsn Leu 



Gly Asn Thr 
160 



Glu Lys Jflu 
190 
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WO 95/33D49 - 70 - 

(2) INFORMATION FOR SEQ ID NO: 17: 

<i\ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 
V(B) TYPE: amino acid 
)) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) \dRGANISM: N. meningitidis 

(B) STRAIN: 867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17; 



PCT/FR95/00701 



Thr Lys Asp Lys Pro\Arg Asn Gly Ala Val Ala Ser Gly Gly Thx Asp 

1 5 \ 10 15 

Ala Ala Ala Ser Asn G^y Ala Ala Gly Thr Ser. Ser Glu Asn Gly Lys 

20 \ 25 30 

Leu Thr Thr Val Leu Asp )tla Val Glu Leu Thr Leu Asn Asp Lys Lys 
35 X 40 45 

He Lys Asn Leu Asp Asn Phaser Asn Ala Ala Gin Leu Val Val Ser 

50 55 /O 60 



Gly He Met He Pro Leu 
65 70 



Met P^o Glu Thr Ser Glu Ser Gly Asn Asn 
75 80 



Gin Ala Asp Lys Gly Lys Asn GlyXGly Thr Ala Phe Thr Arg Lys Phe 
85 \ 90 95 

Asp His Thr Pro Lys Ser Asp Glu L& Asp Thr Gin Ala Gly Thr Pro 
100 105V HO 

Thr Asn Gly Ala Gin Thr Ala Ser Gly W Ala Gly Val Thr Gly Gly 
115 120 \ 125 

Gin Ala Gly Lys Thr Tyr Ala Val Glu Val\ Cys Cys Ser Asn Leu Asn 
130 135 \ 140 

Tyr Leu Lys Tyr Gly Leu Leu Thr Arg Lys Ar Ala Asp Asn Thr Val 
145 150 156 160 

Gly Ser Gly Asn Gly Ser Ser Thr Ala Ala Ala\Gln Thr Ala Gin Gly 
165 170 \ 175 

Ala Gin Ser Met Phe Leu Gin Gly Glu Arg Thr Aap Glu Lys Glu He 
180 185 \ 190 

Pro Lys Glu Gin Gin Asp He Val 
195 200 



15 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: S3032 



WO 95/33049 
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ci) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



10 



Thr Lys 
1 

Thr Asp 
Ser Lys 

Thr Ala 

50 

Val Asp 
65 

Asn Asn 

Lys Thr 
Thr Val 

Asn Gly 

130 

Asn Tyr 
145 

Val Gly 
Gin Ser 
Ser Glu 



sp Lys Pro Ala Asn Gly Asn Thr Ala Glu Ala Ser Gly Gly 
5 10 IS 

All Ala Ala Ser Gly Gly Ala Ala Gly Thr Ser Ser Glu Asn 

i20 25 30 

Leu Thr Thr Val Leu Asp Ala Val Qlu Leu Thr His Gly Gly 
35 \ 40 45 

lie LysYftjn Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu Val 

X 55 60 

Gly lie Met lie Pro Leu Leu Pro Gin Asn Ser Thr Gly Lys 
^ 70 75 80 

Gin Pro Asp Gin Gly Lys Asn Gly Gly Thr Ala Phe He Tyr 

85 \ 90. 95 

Thr Tyr Thr Pro^Lya Asn Asp Asp Lys Asp Thr Lys Ala Gin 

100 \ 105 HO 



Thr Gly Gly Thr 
115 



V 105 
Glh Thr Ala Ser 



Asn Thr Ala Gly Asp Ala 

125 



Lys Thr Lys Thr Ty; 

135 



Val Glu Val Cys Cys Ser Asn Leu 
140 



Leu Lys Tyr Gly Leu Leu Thr Arg Lys Thr Ala Gly Asn Thr 

150 \ 155 160 

Ser Gly Asn Ser Ser Pro thk Ala Ala Ala Gin Thr Asp Ala 

165 \170 175 

Met Phe Leu Gin Gly Glu Arg ^hr Asp Glu Asn Lys He Pro 
180 185 \ 190 

Gin Asn Val Val 
195 



(2) INFORMATION FOR SEQ ID NO: 19 




(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: 891 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Lys Asp Lys Pro Gly Asn Gly Ala Arg Leu Gin Ala 
15 10 



WO 95/33049 
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Gly Ti\r Ser Asn Gly 
20 

Thr Thr Val Leu Asp 

is 



\ 



Gin Lys Leu Asp Asn 

so \ 

lie Met lie Pro Leu 

65 \ 

Ale Asp Lys Gly^Lys 
85 

\ 

His Thr Pro Glu Ser 
100 

Asn Gly Ala Gin Thr 
115 

Thr Lys Thr Tyr Glu 
130 

Tyr Gly Leu Leu Thr 
145 

Asn Ser Ser Pro Thr 
165 

Phe Leu Gin Gly Glu 
180 

Asn Val Val 
195 



Ala Ala Gly Gin Ser Ser Glu Asn Ser Lys Leu 
25 30 

Ala Val Glu Leu Lys Leu Gly Asp Lys Glu Val 
40 45 

Phe Ser Asn Ala Ala Gin Leu Val Val Asp Gly 
55 60 

Leu Pro Lys Asp Ser Glu Ser Gly Lys Asn Gin 
70 75 80 

Asn Gly Glu Thr Glu Phe Thr Arg Lys Phe Glu 
90 95 

Asp Glu Lys Asp Ala Gin Ala Gly Thr Pro Ser 
105 HO 

Ala Ser Asn Thr Ala Gly Asp Thr Asn Gly Lys 
120 * 125 



Val Asn Leu Cys Ser Asn Leu Asn Tyr Leu Lys 
135 . 140 



A 



Arg lyfr/Thr Ala Gly Asn Thr Sly Glu Gly Gly 
150 IK 155 160 



Ala Ala Gin Thr Ala Gin Gly Ala Gin Ser Met 
\ 170 "5 

\ 

Arg Thr Asp Glu hya Glu He Pro Aan Asp Gin 
185 190 

\ 



(2) INFORMATION FOR SBQ ID NO: 20: 



(i) 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 

MOLECULE TYPE: DNA (genomic) 



\ 



SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AAACCCGGAT CCGTTGCCAG CGCTGCCGT 



29 



10 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
15 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



WO 95/33049 



- 73 
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ttttttcaSfg agataxctgg caacattgtt GTTATCTCTG GCGGTGTTAA TCACCGCCGG €0 

GTGCCTGGG* GGCGGCGGCA GTTTC * 5 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) 



(ii) 
(xi) 



2QUENCE CHARACTERISTICS : 

I) LENGTH: 30 base pairs 
,J) TYPE: nucleic acid 
(Cl STRAND EDNESS : single 
( D )\ TOPOLOGY : 1 ine ar 

MOLBCdgtB TYPE: DNA (genomic) 

SEQUENCk DESCRIPTION: SEQ ID NO: 22; 



GTGTTTTTGT TGAGTGCATG CCTGGGTGGC 



30 



10 



15 



(2) INFORMATION FOR SEQ\ID NO: 23 
(i) 



(ii) 
(xi) 



SEQUENCE 

(A) LENGTH: 

(B) TYPE: n 

(C) STRANDED] 

(D) TOPOLOGY: 



CTERISTICS: 
base pairs 
acid 
single 
(near 



MOLECULE TYPE: DNA^ (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



TGCGCAAGCT TACAGTTTGT CTTTG 



CGCGCTGCCG 



40 



(2) INFORMATION FOR SEQ ID NO: 24: 

2 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs N 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:\ 24: 



AAAAAGCATG CATAAAAACT ACGCGTTACA CCATTCAAG 



40 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) ' SEQUENCE CHARACTERISTICS: 

3 0 (A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TATATAAGCT TACGTTGCAG GCCCTGCCGC GTTTTCCCC 



39 



WO 95/33049 
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(2) INFORMATION FOR SEQ ID NO: 26: 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

MOLECULE TYPE: DNA (genomic) 



«03 
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SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CCCGAATTCT GCCGTCTGAA GCCTTATTC 



29 



10 (2) INFORMATION FOR SEQ ID NO: 27: 

( i ) SEQUENCB\CHARACTERI STICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE :\ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE :^DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



15 



20 



25 



CCCGAATTCT GCTATGGTGC TGCCT 



(2) INFORMATION FOR SEQ ID NO: 28: 




(i) 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single, 

( D ) TOPOLOGY : 1 inear 



MOLECULE TYPE: DNA (genomic 



SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



\ 



CGCATCCAAA ACCGTACCTG TGCTGCCTGA 



\ 



28 



30 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 29: X \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



TTTATCACTT 



TCCGGGGGCA GGAGCGGAAT 



30 



10 



15 



20 



25 



30 



35 



WO 95/33049 - 75 - 

(2) I INFORMATION FOR SEQ ID NO : 30: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE : DNA (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GTTGGAACAG CiWS(ACAGCGG TTTGCGCCCC 

(2) INFORMATION kOR SEQ ID NO : 31 

(i) SEQUENck CHARACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

(B) TYPsY nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



PCT/FR95/00701 



(ii) 
<xi) 



MOLECULE TYPE\ DNA (genomic) 
SEQUENCE DESCRIPTIQN : SEQ ID NO: 31: 




30 



GAACATACTT TGTTCGTTTT TGO 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningi tidisN 

(B) STRAIN: IM2394 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Tyr Lys Gly Thr Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: IM23 94 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 



30 



40 



Glu Phe Glu Val Asp Phe Ser Asp Lys Thr lie Lys Gly Thr Leu 
15 10 15 



WO 95/33049 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iiNj MOLECULE TYPE: peptide 

( vi ) \ORIGINAL SOURCE: 

ORGANISM: N. meningitidis 
) STRAIN: IM2394 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Glu Gly Gly Phe TyrVGly Pro Lys Gly Glu Glu Leu 
1 5 \ 10 

(2) INFORMATION FOR ^EQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH :\ 6 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY :\idnear 



(ii) MOLECULE TYPI 



peptide 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N. meningitidis 

(B) STRAIN: IM2394 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION :\£EQ ID NO: 35: 

Ala Val Phe Gly Ala Lys 

5 



1 
(2) 



INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2070 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: BZ83 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 1..60 

(ix) FEATURE: 

(A) NAME /KEY: mat__peptide 

(B) LOCATION: 61.. 2067 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2067 



WO 95/33049 - 77 - PCT/FR95/00701 

AifcAACAATCCATTGGTAAATCAGG CTG CTA7GGTG CTG GCTGTGTTTTTGTTG AGTG CT 

- . . 50 

TACXTGTTAGGTAAC r.-.TTTAGT Z CG AC3AT ACC ACGACGGACACAAAAACAACTCACGA 

M«tAs^sn?roLeuValAs-GIr--.IaAiaMewVaxLeu?roVal?heLeuLeuSerAia 

... - - - _ — - . - - - - + 

TGTCTGGGCG3AGGCGGCAGTTTCGATCTTGATTCTG7CGATACCGAAGCCCCGCGTCCC 

. . . * 120 

ACAGACCCGCCTdCGCCGTCAAAGCTAGAArTAAGACAGGTATGGCTTCGGGGCGCAGGG 

CysLeuGlyGlyGlyGlySerPheAspLeuAspSerValAspThrGluAlaProArgPro 

GCGCCAAAGTATCAAGAT G ' : " \ C TT CCGAAACACCGCAAGCCCAAAAAGACCAAGGCGGA 

• + 180 

CGCGGTTTCATAGTTCTACAAA^AAGGCTTTGTGGCGTTCGGoi 1 1 1 iu 1GGTTCCGCCT 
AlaProi-ysTyrGlnAspValSerSferGluThrProGlnAlaGlnJUysAspGlnGlyGly 

... ^ , * . + 

TACGGTTTTGCAATGCGCTTCAAGCGGCGG^ATTGGTACCCAAAAAATGAAGAAGATCAT 

240 

atgccaaaacgttacgcgaagttcgccgccttAaccatgggttitttacttcttctagta 

TyrGlyPheAlaMecArgPheLysArgArgAsnTfpTyrProLysAsnGluGluAspHis 

AAGGCATTATCAGAAGCGGATTGGGAGAAGTTAGGTGCGGSTAAGCCAGATGAGTTTCCC 

. + . + 300 

TTCCGTAATAGTCTTCGCCTAACCCTCTTCAATCCACGCCCAXTCGGTCTACTCAAAGGG 

LysAlaLeuSerGluAlaAspTrpGluLysLeuGlyAlaGlyL\>6ProAspGluPhePro 

C AAAGGAATGAAATATTGAATATGACTGACGGAATTCTGAGTGAGTCTCCTCAGCTGGGT 

x 360 
GTTTGCTTACTTTATAACTTATACTGACTGCCTTAAGACTCACTCAGAGA^GTCGACCCA 

GinArgAsnGluIleLeuAsr^tezThrAspGIylieLeuSerGluSerLeuQlnLeuGIy 
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GAGGG CGG CAAAAGC CG CG7AG AAGGATACACGGATTTCCAATATGTCCGCTCGGGCTAT 

„ . * . . * . . + 420 

fcCCG CCGTTTTCGGCG CATCTTCCTATGTGCCTAAAGGTTATACAGGCGAGCCCGATA 

Glu^yGlyLysSerArgValGluGlyTyrTKrAspPheGlr.T\TrValArgSerGlyT^ 



ATCTACCG CAACG 3TG C C AATAAAATCGA . . ; CCAAAAAAAAATC3CCCTTTCCGGTCCG 
. . - . . ♦ 480 

TAGATGGCG7TGCCACGGTTAT'- " - AGCTAAAGUi - I i i I a i aAGCGGGAAAGGCCAGGC 

\ 

IleTvrArgAsr.GlvAiaAsrXvsIleAsoPheGlni.vsLysIleAlaLeuSerGlyPro 

\ 



\ 

GACGGCTACCTTTTCTACAAAGGCAGCAATCCTTCCCAAGCTCTGCCGATGGGTAAGGTA 

. . . - . * 540 

CTGCCG ATGG AAAAG ATGTTT C CGTCGTTA 3 G AAGGGTTCGAGACGGCTACCCATTCCAT 

AspGlyTyri-euPheTyrLysGlySerAsnProSerGlnAlaLcuProMctGlyLysVal 



GGTTATAAAGGTACTTGGGATTAljra^CC^ 

. * . + UyC + . * . * . ♦ 600 

CCAATATTTCCATGAACCCTAATOCATTGG CTACGGTTCTACCCTGTTTTTAAAAGGGTC 

GlyTyrLysGl>^hrTrpAspTyrValThrAspAlaLysMetGlyGlnLysPhcSert51n 



TTGGCTGGTTTTC CAGCGGGGGATAGGTATGGGGCl'I I G T C TGCCGAGGAAGCGGATCTG 

^ * . . * . . ♦ 660 

AACCGACCAAAAGGTCGCCCCCTATCCATACCCCGAAACAGACGGCTCCTTCCCCTACAC 

\ 

LeuAlaGlyPheProAaaGlyAspArgTyrGlyAlaLeuSerAlaGluCluAlaAspVal 

\ 

\ 

» + . v . - ♦ 



TTGCGCAACAAAAGCGAGGCACAGCAAGGTCAGACCGATTTCGGGCTGACCAGCGAGTTT 

^ + ^ . .... + . * 720 

AACGC G I T G 7 T l I CGCTCCGTGTCG7TCCAGTCTGGCTAAAGCCCGACTGGTCG CTCAAA 

LeuArgAsnLysSerGluAlaGlnGlnGlyGlnThrAspPheGlyLcuThrSerGluPhe 



. * . * . . - \ 

GAGCTGGATTTCGCCGCCAAGACCATGACCGGCGCCCTCTACCGCAATAACCGGATTACT 

* . . . ^ . + 780 

CTCCACCTAAAGCGGCGGTTCTGGTACTGG^CGCGCGAGATGGCGTTATTGGCCTAATGA 

GluValAspP^eAlaAlaLysThrMstThrGlyAlaLeuTyrArcAsnAsnArgllcThr 
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»TAACGAAACCGAAAATAAAGCCAAACAAA7TAAACGTT^^ 

" Y ' * * * • ♦ 840 

TTATTGCTTTGG CTT77A7TTCGGTTTG7T7AATTTGCAATG CTGTAACTCCGACTGGAC 

AsnA^nG 1 uThrG 1 uXsnLy s Al aLys Glnll cLys ArgTyr Aapl leGinAlaAsptrCu 

GACGGTAAO£G CTTCAGCGG CAAGGGAACGGCAACCG ACAAACCCAAAAACGACGAAACC 

♦ + • . + 900 

GTGCCATTGGOgAAGTCGCCGTTCCGTTGC CGT rC G C VU i ' - T G Gl» 1 : * 1"1 G wIGLl 1 1G G 

HisGlyAsxiArgfthcSerGlyLysAlaThrAlaThrAspLysProLysAsnAspGluThr 

AAGGAACATCCCTTTGTTXCCGACTC G T C TT C1 r " GAGCGGCGG C 7 TT" T CGGTCCGAAG 

. - . «. . ^ 960 

7TCC 1 1 G'l AGGGAAACAAAG^CTGAGCAGAAGAAACTCSCCGCCGAAAAAGCCAGGCTTC 

LysGluHisProPheValSerA^pSerSerSerLeuSerGl>-GlyPhePheGlyProLys 

GGTGAGCAATTCG O I 1 I CCC C ITI 1 IC agCGACGATCAAAAA G TT G CC UT ' lG ' i CGGCAGC 

-+ 1020 

CCACTCCTTAACCCAAAGGCGAAAAACT C 6CT G CTA <j ' l " l " i " l " rC AACGGCAACAGCCGTCG 
GlyGluGluLeuGlyPheArgPheLeuScrAdpAspGlnLysValAlaValValGXySer 

GCGAAAACCAAAGACAAACTGGAAAATGGCGCGGCG<j^TTCAGGCAGCACAGGTGCGGCA 

1080 

cc i i i ti g gt 1 1 c it, 1 1 ic ac c : t i t a ccgcgccgccgAgtccgtcgtctccacgccgt 

AlaLysThrLysAspLysLeuGluAsnGlyAlaAlaAlaSe^GlySerTtirGlyAlaAla 

-4- 

GCATCGGGCGGTGCGGCAGATATGCCGTCTGAAAACGGTAAGCTG^CACGGTTTTGGAT 

-+ 1140 

CGTAGCCCGCCACGCCGTCTATACGGCAGA C T " X 1 1 CCCATTCCACTCdg' C CCAAAACCTA 
AlaSerGlyGlyAlaAlaAspMecProSerGluAsnGlyLysLeuThrThrValLeuAsp 

GCGGTTGAGCTGAAATCTGGCGGTAAGGAAGTCAAAAATCTCGACAACTTCAGQAATGCC 
. + . «. ^ . + \ * 12 00 

CG CCAACTCG AC TT T AG ACCG C C ATTCCTTC A G i I ' L r ' l 'AGAGCT G TT G AAGTCGTTACGG 
AlaValGluLeuLysSerGl'i.'GiyLysGluValLysAsnl.euAspAsnPheSerAsVAla 



* 
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I GCC CAACTGGTTGTCG ACGG CA7TATGAT7 Z CG" C CT "CCAAGAATTCCGAAAG CGAG 

V • " * ~ . * ♦ 12S0 

tGGGTTGACCAACAGCTGCCGTAATACTAAGGCSAGGACGGGTTCTTAAGGCTTTCGCTC 

AlaGlxiLeuVaIValAspGlyIieMetIl«ProLeuLeuProLysAsnSerGluSerGlu 




AGCAATSAGG C AG ATAAAGG7AAAAACGG CCG AAC AGCCTTT ACCCG CAAATTTGAACAC 

* * . * • + 1320 

rCGTTAGTGCGTCTATTTCCAr T " r i GCCGC Cl I UX CGGAAATGGS CG T TT AAA CTlt»rG 

SerAsnGlnA^aAspLyBGlyLysAsnGlyGlyThrAla?heThrArgI*ysPhcGluHis 



ACGCCGGAAAGTGATAAAAAAGACACCCAAGCAGGTACGGCGGAGAATGGCAATCCAGCC 

'llZZ • " " • * • + "80 

TGCGGCCT 1 A CACTA'i IVi'i'l CTGTGGG7TCGTCCATGCCGCCTCTTACCGTTAGGTCGG 

ThrProGluSerAspLysl-ysAspThrGlxiAlaGlyThrAlaGluAsnGlyAsnProAla 

------- . . . + 

\ 

gcttcaaatacgccaggtgatAccaatcgcaaaaca^ 

* # + + . ♦ . * . ♦ 144 0 

cgaagtttatgccgtccactatggttagcgttttgtttttggatacttcagcttcagacg 

AlaS e r AsnThr Al a G 1 yAspThr AsnG^y^LysThr Ly sThrTy rGl uVa 1 Gl u Val Cys 

_ _ \ 



TGTTCCAACCTCAATTATCTGAAATACGGAATGTTGACGCGTAAAAACAGCAAGTCCGCG 

. * o . ♦ . ♦ isoo 

ACAAGGTTGGAGTTAATAGACTTTATGCCTTACAACTGCGCATTTTTGTCGTTCAGGCGC 
CysSerAsnLeuAsnTyrLeuLysTyrGlyMetLeuThrArgLysAsnSerliysSerAla 



ATGCAGGCAGGCG AAAACGGTAGTCTACCTGACGCTAAAACGGAACAA G 1 1GAACAAAGT 

. * . * . ♦ . * . . ♦ 1560 

TACGTCCCTCCC Ci. 1 V1 GCCATCAGATCGACTGCC A1 1 i. IG C L: I b H 'CAA L 1 1U A i ' A CA 

\ 

MecGlnAlaGlyGluAsnGlySerLeuAlaA5cAlaLvsThrGluGlnValGluGlnSer 



\ 



\ 



ATGTTCCTCCAAGGCG AG CG CACCG ATG AAAAAGAGATTCGAAAAGAGCAACAAGACATC 
. . ^ ^ . * \ + 1620 

TACAAGGAGGTTCCGCTCGCGTGGCTA C IT I ! rC "CTAAG G 77T T'CTCGT T G VA'd' G TAG 
NfitPheLeuGlnGlyGluArcThrAspGluLysGlulleProLysGluGlnGlnAsplle 



# 
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81 
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GTTTJ^CGGGGGTCTTCGTACGGGCATATTGCCAACGACACAAGCTGGAGCGGCAATGCT 

. •* 1680 

CAAATJteCCCCCAGAACCATGCCCGTATAACGGTTGCTGTGTTCGACCTCGCCGTTACGA 

ValTyrAJrgGl ySerTrpTyrGI yHis X leAlaAsnAspThrSerTrpSerGlyAsnAla 
TCAGATAGAGAQpGCGGCAACAGGGCGGACTTTACCGTSA Al 1 1 TGGTA CGAAAAAAATT 

1740 

AGTCTATCTCTCCflff CCGTTGTCCCGCCTGAAATGGCACTTAAAACC A T GC I X I 1 1TXA A 
SerAapAxgGluGly^yAsnArgAlaAspPheThrValAsnPheGlyThrLycLyfille 

AACGCSAACGTTAACCGCTGAA^CAGGCAGGAGGCAACCTTTACCATTGTGGGCGATATT 

_ 1800 

TTCCCTTO^TTGCCGACTTTTOTCCGTC 

AsnGlyThrLeuThrAlaGluAsnAf^GlnGluAlaThrPheThrlXevalGlyAspZle 

AAGGACAACGGCTTTGAAGGTACGGCGAAAAOgtSCTGACTCA GGi - l 1 1U ATCTCCATCAA 

- + 1660 

TTCCTGTTCCCGAAACTTCCA7CCCG I I 1 n C A&PACTGAGTCCAAAACTAGACCTAGTT 
LysAspAsnGlyPheGluGl^-ThrAlaLysThrAlXAspSerGlyPheAspLeuAspGln 

-V * - -- - . > 
AGCAATACCA CCCG CACG CCTAAGGCATATATCACAGATGCpAAGGTGAAGGGCGGTTTT 

1920 

TCGTTATGGTGGGCGTGCGGATTCCGTATATAGTGTCTACGG^TCCACTTCCCGCCAAAA 
Ser AsnThrThrArgThrProLysAlaTyx I leThrA^pAlal^ykValLycClyClyPhc 

TACGGGCCTAAAGCCGAAGAGTTGGGCGGATGGTTTGCCTATCCGGGCGA^AAACAAACG 

Z980 

ATGCCCGGATrTCGGCTTCTCAACCCGCCTACCAAACGGATAGGCCCGCTAVTTGTTTGC 
TyrGlyProLysAlaGluGluLsuGlyGlyTrpPheAlaTyrProGlyAspLVfiGlnThr 
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GAAAAGGCAACGG* a ACATCC3G Co ATGG. 



TTCAGCAAGCAGTGCAACTGTCGTATTC 



2040 



10 



15 



20 



25 



(ii) 
(vx) 

(ix) 

(ix) 

(ix) 




CTTTTCCg^TCCCAATGTAGoCCGCTACCTTTAAGTCGTTCGTCACGTTGACAGCATAAG 
GluLysAlaf^rValTlirSerGlyAspGlyAsnSerAlaSerSerAlaThrValValPhe 



GGTGCGAAACGCCAAA^GCCT GTGCAAT* 

x ■ 2070 

C«CGCTTTGCGG77TrC?teACAC3TTATT 

G 1 yAl a Ly s Ar gG I aLys PrbVa lGlnTsr 



INFORMATION 20OR SEQ Ifil NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669\amino acids 

(B) TYPE: amino apid 
(D) TOPOLOGY: line) 

(ii) MOLECULE TYPE: prot 

(xi) SEQUENCE DESCRIPTION: S^EQ ID NO: 37; 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2136 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 

ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: BZ163 

FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 1..60 

FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 61.. 2133 

FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2133 
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ATGAAl 
TACTTl 



JUlQ^TCCATTGG7AAATCAGG~GCTATGGTG~GC~~GTTTrTGTTCAC 
TTG^AGGTAACCATTTAGTCCGACGATACCACGACGGAC^C^AAAACA^CTC 



TGAGTGCT 
CTCACGA 

MetAsnAskProi*euValAanGacAlaAla«etValLeuProValPheLeuLr«uSerAla 



60 



a u a i iCGGCGGA^GCGGCAGTTTCGA iw*. GATTCTGTCGA7ACCGAAGCCCCGCGTCCC 
ACAAACCCGCCreiGCCGTCAAAGCTAGAACTAAG^ 

CysI^uGlyGlyGlyciySerPheAspLeuAspSerValAspThrGXuAlaProArgPro 



120 



GCGCCAAAATATCAAGA* i,\ • i. . : CCGAAAAACCGCAAGCCCAAAAAGACCAAGGCGGA 



CGCGGTTTTATAGTTCTAC 
AlaProLysTyrGInAspValSe* 



SCI l'l 1TCGCGTTCGG G* ill rCiWrr CCCCCT 
|TGluL>rsProGlaAlaGlfiLyaAspGlnGlyGly 



180 



VTCGGCATCCGCAGGCAAAAGAAGACAAA 



TACGGT1 1 1 laCG ATG AGGTTG AAACGGJ3 

ATGCCAAAACGCTACTCCAACTTTCCCTCCTTAG CCGTAGGCGTCCGTTTTC 
TyrtlyPhcAlaMetArgLauLysArgArgAsktegHisProGIxxAlaLysGluAspLys 



240 



GTTGAACTAAACCCAAATGATTGGGAGGAGACAGGATTGCCGAGCAAGCCCCAAAACTTA 
CAACTTGATTTGGGTTTACTAACCC7CCTC7CT 

ValGlul^uAsnProAsiiAspTrpGluGluThrClyl^u^ 



300 



CCCGAGCGACAGCAATCGGTTATTGATAAAGTAAAAACAGACGATGG^GCAATATTTAC 

GGGCTCGCTGTCGTrAGCCAATAACTATTTCATT^GTCTGCTA 

ProGluAi^GlnGlnSerVallleAspLysValLysThrAspAspGlyS^AanlleTyr 



360 
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, f 

■ — — * -i 2 0 
3CCACGTTTG 



GlvAlaAsn 



TGAaGGGG AATAG AST G CGTTAGTTTGGTAGTTTTGCw GTCGT3ATTATCGCC? 
ThrSerProT-vrLeuThrGInSerAsnHisGlnAsnGlvSerThrAsaSerGl\ 

\ 

;;• ;x;:;;.;:;.:;::z;;:z:'" 

CAACCAAA^ArvCGAAG ^. ruw\Gn - .aJvwiAaa - * J«v\*a*wi - aATTCCGGCTGGTTT 

. -3_- . ^ . - . * 480 

GTTGUl 1 1 : . .v GCTTCATTTTCTAATG A i : * a ' A AAGTTTATACAAATAAGGCCGACCAAA 

\ 

GlnProLysAsr.GluValLysAspTv^LysAsRPheLysTyrValTy^SerGlyTrpPh^ 



\ 

TATAAACATGCAGAGAGTGAAAGAGAATTCAGTAAAATCAAATTTAAGTCAGGCGACGAC 

. . - - V . . m . . * S40 

AT ATT rUTA CGTCTCTCA L 1 . * I CTCTTAAGTCA " ' 1"I VAU * 1 a A AATTCACTCCCCTCCTG 

\ 

TyrLysHisAIaGluSerGluArgGluPheSerLysIleLysPheLyaSerGlyAspAsp 

+ . + . ------- — . - -- -4. 

GGCTAT A TT T I I 1 ATCACGGTAAAG ACCCTTCCCGACAACTTCCCAL 1 1 ' C T GA AAAAGTT 

. + . A---* . * . «- . ♦ 600 

CCGATATAAAAAATAGTGCCATTrCTGGGAAGGGCTGTTGAAGGGTGAAGAL 1 1 1 1 1 CAA 

v 

GlyTyrIlePheTyrHisGlyLysAspProScrArgGlnl#euProThrSerGlul*ysVal 

\ 

. . - - — . — - . - .-.-4. 



ATCTACAAAGGCGTATGGCATTTTGTAACCCATACTGAAAAGGGACAAAAATTTAACGAT 

. ^ . + - . * . «. . * €60 

TAGATGTTTCCGCATACCGTAAAACATTGGCTATGA C TTTTCC LIUI 1 1 1 1 AAATTGCTA 

IleTyrLysGlyValTrpHisPheValThrAspThrGluLysGlyGlnLysPheAsnAsp 
m - . ♦ . + - 

A ' l 1 ' C ' l I GAAACCTCAAAAGGGCAAGGCGACAGATACAGCGG A ' l 11 1 CGGGCGATGACGGC 

. * . * . ♦ . *-- r - . ♦ . + 720 

TAAGAA C i ' 1 1 ' G GA U 1111 CCCGTTCCGCTGTCTATGTCGCCTAAAAGCCCGCTACTGCCG 

IleLeuGluThrSerLysGlyGlnGlyAspArgTyrSerGlyPheSerGlyAspAspGly 



GAAACAACTTCCAATAGAACTGA7TCCAACCTTAATGATAAGCACGAGGGTTATGGTTTT 

. , . . . . ♦ 730 

CTTTGTTG AAG GT7 ATCTTG ACT AAGGTTCjGAATTACTATTCGTG CTCCj? AATACCAAAA 

GXuThrThrScrAsnArgThrAspSerAsnl^uAsnAspLysKisGluGlyryrGlyPhe 



v 
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ACCTCGAA i a * AGAAGTGGA- ~ - C3GCAGTAAAAAATTCACGGGTAAA7TAATACGCAAT 

lllyi— rri"— :rz":::: * * a«o 

l0v,/ v w - - <rUvw - *- • .%A*w./wi^w^.-A ~r~"CA_ . - AATTATGCGTTA 

rhrSe*AsnI,euGIuValAspPheGlySer^^ 

AATAGAGTTACAAACGCTArTACTAACGATAAATACACCACCCAATACTACAGCCTTGAT 

" * * """"* "" " ' A t - ----- - - - . * • - - 900 

TTATCTCAATv* i ^ -OCGATGATGA* -3w*A.. * A7GTG3TGGGTTATGATGTCGGAACTA 

AsnArgValThrAsnA^ThrThrAsr^ 

GCCCAAATAACAGGCAACCGCTTCAACGGTAAGGCGATAGCGACCGACAAACCCGACACT 

_ x _ ♦ . » . + 960 

CGGGTTTATTGTCCGTTGGCGAAG7CGCCA— CCGC7A7CGCTGGCTGTTTCGGCTCTGA 

AiaGlnlleThrGlyAsnArcPheAsXciyLysAlalleAlaThrAspI-ysProAspThr 
- - - - * . - . - - *lfy( m ' • ~ -------- , , 

GGAGGAACCAAACTACATCC L 1 1 1 U * a I CCCACT^ U A 1 1 1 L 1 1 1 U AGCCGCGGCTTTTTC 

" * V - . ♦ . «. 1020 

CCTCCTTGGTTTGATGTAGGGAAACAAAGGCTGAGGAGAAGAAACTCGCCGCCGAAAAAG 

GlyGlyThrLysLeuHisProPheValSerAspSerSArSerLeuSerGlyClyPhePhe 

GGTCCGAAGGGTGAGGAATTGSGTTTCCCCTTTTTC^^ 

" • . V „. . * 1080 

CCAGGCTTCCCACTCCTTAACCCAAACCCCAAAAACTCGC^ 

GlyProl-ysGlyGluGluLeuGlyPheAxgPheLeuSerAspAspLy^LysValAlaVal 

GTCC^CACCGCGAAAACCAAAGACAAAACCGAAAATGGCGCGGTGGCTTCAGGtGGCACA 

. . * \ ♦ 1140 

CAGCCGTCGCGw "i i . .GGTTTCTGTTTTGCCi . i . ACCGCGCCACCCAAGTCCGOCGTGT 

ValGlySerAlaLvsTr.rLysAspLysTr.rGIuAsrX^ 
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k 



rGCGGCAGCATCAAACGGTGCGGCAGGCACGTCGT- . GAAAACAG7AAGCTGACCACG 

• . - - - . * 1200 

CTAfGCCGTCGTAGTTTGCCACGCCGTCCGTGCAGCAGAw i L . . w i CATTCGACTGGTGC 
lAI a S e r As r.G 1 yA 1 aAl aG i yTh rSerSerGl uAs nS erLysLe uT.^r Thr 



Gi i * .GGATG^GGTCGAG- . GAAATTGGGCGA7AAGGAAG7CCAAAAGCTCGACAACTTC 

■■■■ ; ■ . - • * . ♦ 1260 

CAAAACCTACGCCAGw . CGA* . . - AACCCGC7ATTCC77CAGG7TT7CGAGC7G7TGAAG 

ValLeuAspAlav)^GiuL*uLysLeuGlyAspLysGluVaIGlnLysLetiAspAsnPhe 



AGCAACGCCG CCCAACTG 0^ TG7CGACCCCA7TATGATTCCG C I C T TOCCCGAGACTTCC 

1320 

TCGTTGCGGCGGG7TGACCAACAGCTGCCGTAATACTAAGGCGAGAACGCGCTCTGAAGG 

\ 

SerAsnAlaAlaGlnl^euValValAsoGlvileMecileProLeuLeuProGluThrScr 

\ 

\ 




G AAAGTGGG AACAATCAAGCCAATCA^WSGTACAAATGG CGG AACAGCCTTTACCCGCAAA 

* . * . + 1380 

CTTTCACCCTTGTTAGTTCGGTTAGTTCCATCrrTTACCGCCTTGTCGGAAATGGGCGTTT 

GluSerGl yAsnAsnGlnAlaAsnGlnGl yThr AsnGl yGlyThrAl a PheThr ArgLy s 

\ 

\ 

\ 

T7TGACCACACGCCGGAAAGTGA7AAAAAAGACGCCCAAGCACGTACGCAGACGAATGGG 

. 1 ^ „. 1440 

AAACTGGTGTGCGGC C ♦ ' . 1 CACTA I II 1 1 i LA ^ CC UVl i CGTCCATGCO TCTGC I TACCC 
PheAspHisThrProGluSerAspLysLysAspAlaGltiAlaGlyThrGlnThrAsnGly 



G CGCAAACCG CTTC AAATACGGC AGGTG ATACCAATGGCAAAACAAAAACCTATG AAGTC 

* . ♦ . . . ♦ . ♦ 1500 

CGCGI 1 i 3GCGAAGTT7ATGC757CCAC7ATGGTTACC3 ± L + *l,U 1 1 J, GGATACTTCAG 

AlaGlr.ThrAlaserAsnThrAlaGlyAspThrAsnGlyLysThrLvsThrTvrGluVal 



GAAGTCTGCTGTTCCAAC— CA-.77 A7C7GAAA7ACGGAA7GT7GACGCGCAAAAAGAGC 

. ~ ^ „ . - 1560 

CI 1 C AGACGACAAGGT7GGAG77AA7AGACT 77ATGCCTTACAACT3CCCC 1 I 1 ■ 1G1 CG 

\ 

G 1 u Va 1 Cy s Cy s S ± r As r. LeuAs r.T/r LeuLysTyrGi yMe t le uTh r Arg Ly s As nS e r 
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A^S^CCG CGATG CAGG CAGGAGAAAGCAGTAGTCAAG C7GATGCTAAAACGGAACAACTT 

y * * 1520 

TTCAG^GCTACGTCGGTCCTCTTTCGTCATCAGTT rGACTACGATTTTGCCTTGTTCAA 

LysScr^aMetGirJ^laGlv-GluSerSerSerGlr-AlaAspAlaLysTfarGluGlnVal 

- - - - . - -A- . . - -- ------ - . -- — . - . + 

GGACAAAGTATwT^C~CC~3GCGAGCGCACCGATGAAAAACAGAT^ 

. ^ 16B0 

C CTGTTT CATAC AAG GAG GTTC C G CGTGG CTACTTTTTCTCTAAG GTTCG CT CGTT 

GlyGlnSerHetPhea^uGlnGIv-GluArgThrAspGIuLysGluIleProSerGluGln 

AACATCGTTTA7CGGSGG7L .% GGTACGGGCATATTGCCAGCAGGACAAGCTGGAGCGGC 

* 1740 

TTGTAGCAAATAGrrCCCAGAAOCATGCCCGTATAACGGTC G TC ^ , l GT i C GACCTCGCCG 
AsnIleValTyrAr=GlySerTrpT^r^?yHisIl€AlaScrSetThrScrTrpSerGly 

AATGCTTCTGATAAAGAGGGCGGCAACAGGGSpGAATTTACTGTCA A T T T T G GCCAGAAA 

. + 1800 

TTACGAAGACTATTTCTCCCGCC G TT G T CCCGCgTTAAATG ACACTTAAAACCGCT C I ' l 1 
AsnAl a Se r As pLy sG 1 uG 1 yG 1 yAsnArg Al aG 1 u^heThr Va 1 As n PheG 1 yG 1 uLys 

AAAATTACCGGCACGTTAACCCCTGAAAACAGCCAGGAGCC^C CTI ' AA CCATTGATGCT 

- -~ * 1860 

TTTTAATGGCCGTGCAATTGGCGACTTTTGTCCGTCCTCCGTTGrteAAATGGTAACTACCA 

LysIleTHrGlyThrLeuThrAlaGluAsnArgGlnGluAlaThrPheThrlleAspGly 

- . - - - - 4- 

AACATTGAGGCCAACG ^ 1 1 1 : ' . "GGTACGGC AAAAACTG CTGAATTAG U T\TT G ATCTC 

1920 

TTCTAACTCCCG— GCCAAAAAGGCCATGCCG i ■ i : 1 GACG ACTTAATCCAAAftCTAGAG 
LysIleGluGlyAsr.GlyPheSerGiyThrAlaLysThrAlaGluUeuGlyPhc^spLeu 
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GA x Cji»-%AAAAATAC CAC C CG C AC 3 C C - AAGG CATATATCACAGATG Z CAAGGTGCAGGGC 

CT/\^» a. TTTTTATGG~GGGC3T3CGGAT j rCCGTATATAGTG1*CTACGG » '-' CCACGT"CCCG 

v 

AspGl^ysAsnThrThrArgThr?roLysAl£T\'rIisThrAspAlaLysVaIGlnGly 



\ 



GGTTTTT ACG GG C C CAAAG C C GAAG AG TTG GG CGG ATG GTTT 3 C CT AT CAGGG CG AT AAA 
CCAAAAATGCGTGGGTTTCGGCTTCTCAACCCGCCTACCAAACGGATAGTCCCGCTATTT 
GlyPheTyrGly?roLysAlaGluGluLeuGlyGlyTrpPheAlaTyrGInGlyAspLys 



1980 



2040 



^/ww%Cuo/vrvA>\ i ACjtACA3T^TGCATCC*;GCAATGGAAATTC-— .GCAAGCAGTGCAACTGTC 



GTTTGCC I * ^ 1 ATGTTGTCAACGTAGG CCGTTACCTTTAAGTCGTTCGTC ACGTTGACAG 
GlnThrGluAsnThrThrValAla&BxGlyAsnGlyAsnScrAlaScrSerAlaThrVal 



2200 



v GTATTCGGTGCGAAACGCCAAAAGCCTGTGCAATAA 
CATAAG C CACG C I T GCGG 1 1" T T CGG AC ACGTTATT 
ValPheGlyAlaLysArcGlnLysProValGlnTer 
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INFORMATION 20OR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 692 amino acids, 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



